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PART  I 


MULTISTATE  COHERENT  STRUCTURE  FUNCTION  THEORY 

In  military  systems,  power  plants,  and  many  other  areas  of  modern 
society,  the  consequences  of  failure  can  be  catastrophic.  The  study  of 
reliability  has  evolved  from  the  desire  to  prevent  failure,  or  at  least 
mitigate  the  consequences  thereof.  Reliability  is  usually  defined  as 
the  probability  that  a  component  or  system  is  able  to  perform  its 
specified  function.  A  reliability  analysis  is  performed  to  determine 
that  probability.  This  analysis  consists  of  mathematically  modelling 
the  system,  assessing  failure  probabilities,  and  generating  a  numerical 
result.  This  thesis  deals  with  aspects  of  the  theory  underlying  the 
mathematical  model. 

Most  reliability  calculations  are  performed  assuming  that  components 
and  systems  are  either  functioning  or  failed.  This  dichotomy  is  often  a 
reasonable  assimption,  but  the  assunption  is  sometimes  made  simply 
because  there  are  no  applicable  results  dealing  with  more  complicated 
state  spaces.  There  are  many  situations  in  which  the  ability  to 
consider  multiple  states  would  be  useful  in  a  reliability  context.  A 
component  or  system  may  have  a  useful  partially  operating  mode.  For 
example,  if  one  of  two  turbines  in  a  power  plant  is  undergoing  repair, 
the  plant  may  be  able  to  generate  5GK  of  its  rated  electric  capacity 
which  is  significantly  better  than  being  completely  shut  down.  It  may 
also  be  useful  to  differentiate  among  different  modes  of  failure.  A 
valve  may  fail  to  open,  fail  closed,  or  fail  ruptured,  and  these  failure 
modes  may  have  very  different  effects  on  system  operation.  The  enlarged 
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state  space  can  be  used  for  actual  quantities  rather  than  just 


qualitative  measures. For  example,  states  0,  1,  100  might  be  water 

temperature  in  degrees  centigrade. 

This  thesis  considers  reliability  in  a  general  multistate  setting. 
The  number  of  states  used  in  describing  the  status  of  each  component  and 
the  operation  of  the  system  is  allowed  to  vary,  and  the  dependence  of 
the  system  state  on  the  component  states  can  be  very  general.  The  first 
part  of  the  thesis  deals  with  the  extension  of  binary  coherent  struc¬ 
ture  function  theory  to  the  multistate  case.  Some  results  in  this  area 
have  previously  been  obtained.  The  existing  results  are  generalized, 
and  several  new  results  are  derived.  The  first  part  of  the  thesis  could 
be  applied  to  maintenance  policies  by  letting  the  component  states  rep¬ 
resent  the  total  available  number  of  a  certain  binary  component  and  let¬ 
ting  the  system  state  represent  the  total  number  of  operating  machines. 
However,  the  second  part  of  the  thesis  is  more  useful  for  determining 
optimal  maintenance  policies  for  multistate  components.  Most  previous 
maintenance  models  deal  with  discrete  time  replacement  policies.  Compo¬ 
nents  which  may  be  repaired  or  replaced  at  any  time  are  considered  in 
this  thesis. 
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1.  INTRODUCTION  AND  HISTORY 


Coherent  structure  theory  is  an  attempt  to  treat  reliability  theory 
in  an  axiomatic  way.  Since  reliability  is  an  engineering  discipline, 
the  axioms  and  definitions  are  intended  to  reasonably  portray  the  opera¬ 
tion  of  systems  and  components  in  the  ‘’real  world".  The  first  systems 
to  be  considered  from  a  reliability  viewpoint  consisted  of  binary  compo¬ 
nents,  components  for  which  one  state  represents  operational  status  and 
the  other  state  indicates  failure.  The  structure  function,  which  repre¬ 
sents  the  state  of  the  system  given  the  states  of  the  components,  was 
usually  assumed  to  be  binary  and,  in  addition,  was  usually  assumed  to  be 
coherent.  Binary  coherence  means  that  each  component  of  the  system  is 
relevant,  i.e. ,  important  in  determining  the  value  of  the  system  struc¬ 
ture  function,  and  that  the  system  state  cannot  decrease  when  the  state 
of  one  of  its  components  increases.  It  seems  reasonable  to  expect  most 
real  systems  to  operate  in  this  fashion. 

Because  many  components  may  partially  function  or  have  several  modes 
of  operation,  there  have  been  attempts  to  generalize  the  theory  to  in¬ 
clude  components  or  systems  with  several  states.  However,  there  are 
many  gaps  in  the  theory  of  multistate  coherent  structures  which  need  to 
be  filled,  and  the  theory  is  not  as  general  as  is  desirable  for  engi¬ 
neering  purposes.  In  particular  the  concepts  of  relevance  and  coherence 
for  components  with  several  states  need  to  be  broadened  to  represent 
useful  ideas  in  the  operation  of  multistate  systems.  The  first  part  of 
this  thesis  is  an  attempt  to  remedy  the  situation.  After  a  review  of 
the  binary  case  and  recent  generalizations  in  this  chapter,  definitions 
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are  given  in  Chapter  2  which  are  meant  to  reconcile  coherent  structure 
function  theory  with  the  "real  world”  operation  of  multistate  compo¬ 
nents.  The  rest  of  Chapter  2  and  all  of  Chapter  3  show  that  most  of  the 
results  in  the  binary  case  have  analogues  in  the  multistate  case.  Block 
diagrams  and  fault  trees  and  their  extension  to  multistate  components 
are  the  subject  of  Chapter  4.  Chapter  S  pertains  to  components  which 
have  a  continuum  of  states  rather  than  multiple  discrete  states. 

1.1.  Binary  Coherence  and  Notation 

Reliability  is  a  relatively  new  field  and  is  primarily  the 
outgrowth  of  the  concern  about  military  hardware  reliability  following 
World  War  II.  The  reliability  literature  of  the  1940's  and  1950' s  is 
primarily  devoted  to  quality  control,  renewal  theory,  and  properties  of 
various  life  distributions.  The  first  paper  to  treat  reliability  from  a 
functional  point-of -view  seems  to  be  Birnbaum,  Esary,  and  Saunders 
(1961].  That  paper  defined  a  binary  coherent  system  and  its  dual  and 
developed  many  of  their  properties  including  minimum  paths  and  cuts. 

The  first  reliability  bounds  for  a  coherent  system  with  independent 
components  were  developed  by  Esary  and  Proschan  [1963],  A  paper  by 
Birnbaum  and  Esary  [1965]  introduced  the  concept  of  modules  and 
developed  some  of  their  basic  properties.  Bodin  (1970]  then  used 
modules  to  obtain  better  bounds  on  system  reliability.  The  idea  of 
associated  random  variables  was  introduced  by  Esary,  Proschan,  and 
Walkup  [1967],  and  Esary  and  Proschan  [1970]  used  that  concept  to  obtain 
reliability  bounds  for  systems  with  dependent  components.  The  concept 
of  reliability  importance  is  due  to  Birnbaum  [1969].  A  complete 
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treatment  of  binary  coherent  structure  functions  and  their  properties 
may  be  found  in  Barlow  and  Proschan  {1975a].  The  notation  of  that  book 
is  used  throughout  this  section. 

In  binary  coherent  structure  function  theory,  the  components  and 
system  can  be  in  one  of  two  states  represented  by  the  numbers  0 
(failed)  and  1  (functioning).  Let  X^  be  the  state  of  the  ith 
component  in  a  system  composed  of  n  components. 


Notation:  X,  =  1  ,  .  , 

-  i  (component  i  is  function) 

11  if  component  i  is  functioning 

0  if  component  i  is  failed 

is  called  the  indicator  function 

—  =  ^Xl*  X2’  *“•  Xn^ 

$(X)  »  the  state  of  the  system  5  1,  _  ,  .  .  , 

—  (system  functioning) 

»  :  <0,1>"  ♦  (0,1) 

*  ♦(X1-X2 . X1-CJ’X1+1 . V 


Definition:  Component  i  is  relevant  if  there  exists  X  such  that 
Kl^X)  -  1  and  tfO^X)  -  0. 

It  will  be  assumed  that  all  components  are  relevant  since  irrelevant 
components  have  no  bearing  on  the  system  state. 

Definition:  A  system  represented  by  $(X)  is  coherent  if 

(1)  ♦(£)  -  0,  4>U)  -  1,  and 

(2)  $(X)  is  increasing  in  X. 
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Note:  Increasing  is  used  to  mean  X^  >  Xj  ^  2.  ♦(J^)  while 

strictly  increasing  will  mean  Xj  >  X  -■»  ^(Xj)  >  7116  8ame 

applies  to  decreasing  and  strictly  decreasing.  X  >  Y  means 
Xj  2  v  1  and  *£  >  Y^  for  some  i. 

Definition:  A  path  vector  is  a  vector  X  such  that  $(X)  ■  1.  The 
corresponding  path  set  is  (i:X^»l).  A  path  vector  X  such  that 

Y  <  X  ♦(Y)  «0  is  called  a  minimal  path  vector  and  the  associated 
path  set  is  called  a  minimal  path  set.  The  jth  minimal  path  set  is 
denoted  P • 

Definition:  A  cut  vector  is  a  vector  X  such  that  $(X)  ■  The 
corresponding  cut  set  is  {i  :  X  ■  0}.  A  cut  vector  X  si  that 

Y  >  X  ♦(Y)  «  1  is  called  a  minimal  cut  vector  and  the  a.  . >ted  cut 
set  is  called  a  minimal  cut  set.  The  jth  minimum  cut  set  i-  denoted 


Minimal  path  sets  and  minimal  cut  sets  are  sometimes  called  min 
paths  and  min  cuts,  respectively. 

Definition:  The  dual  of  ♦  is  denoted  by  and  is  defined  by 

♦D(x)  =  i  -  ♦a-x). 

It  is  easy  to  show  that  m  $.  Minimal  path  sets  for  $ 

are  minimal  cut  sets  for  end  vice  versa.  The  following  are 

classic  examples  of  coherent  systems. 

Example  l.i:  A  series  system  is  one  in  which  every  component  must  func¬ 
tion  in  order  for  the  system  to  function. 
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n 

<KX)  -  n  X.  -  min  (X  )  . 

i-1  i  1 


Each  component  represents  a  minimum  cut  set,  and  the  only  path  vector 
is  J..  □ 

Example  1.2:  A  parallel  system  is  one  in  which  the  system  will  function 
if  any  component  functions. 

n  n 

<KX)  -  1  X  5  1  -  n  (1-X  )  -  max  (X  ) 

i-1  i-1  1  i 


Each  component  is  a  minimum  path  set,  and  the  only  cut  vector  is  0. 

The  dual  of  a  parallel  system  is  a  series  system  and  vice  versa.  0 

Example  1.3:  A  k-out-of-n  system  is  an  n-component  system  that  will 
function  if  k  or  more  of  its  components  function. 


*(X) 


^i-jX^k) 


The  dual  of  a  k-out-of-n  system  is  an  (n-k+l)-out-of-n  system.  A 
series  system  is  an  n-out-of-n  system  and  a  parallel  system  is  a 
1-out-of-n  system.  0 

Let  (C,$)  denote  a  set  of  components  C  and  a  coherent  structure 
function 

Definition:  (A,x)  is  a  module  of  if  A  _c  C  and  $(X)  “ 

A  AC 

<Kx(X  ) ,  X  )  where  <l>  is  also  a  coherent  structure  function. 
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The  notation  X^  means  the  vector  with  elements  X^,  i  c  A,  and  A*2 
means  the  set  complementary  to  A. 

Definition:  A  modular  decomposition  of  a  coherent  system  (C,$)  is  a 
set  of  disjoint  modules  (Aj,Xj)»  ••••  (A^.x^)}  together  with  an 
organizing  structure  such  that 

r 

(1)  ^  Ai  and  A^n  A^  *  (|)  v  i  i*  j,  and 

A  A  A 

(2)  <KX)  -  (KXjCx  M.  X2(x  2),  ...,  xr(x  r))  . 

Example  1,4:  A  trivial  modular  decomposition  is  4KX)  * 

A  A 

♦(Xjfx  ^.....XpCX  r))  where  xCX^  -  X1  vi.  As  a  more 

useful  example,  consider  a  system  composed  of  two  elements  in  series 

followed  by  two  elements  in  parallel  as  in  Figure  1.1. 

*(x)  -  <Kxl(x1,x2),  x2<x3,x4)) 

where 

X1(X1»X2>  -  xLx2 

x2(x3,x4)  -  X3  +  x4  -  X3X4 

<Kxl,x2)  -  xLx2  -  x1x2(x3  +  X4  -  x3x4)  .  □ 
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Figure  1.1  System  Diagram 

The  preceeding  discussion  and  definitions  have  dealt  with  the 
deterministic  aspects  of  coherent  systems.  The  following  discussion 
relates  to  probabilistic  evaluation  of  system  operation  with  the 
component  and  system  states  considered  as  random  variables.  Reliability 
is  defined  as  the  probability  of  successful  operation. 

Notation;  Pi  =  component  reliability  *  P(Xt  *•!)«•  EX^  . 

h  =  system  reliability  -  P(  $(X)  ■  1)  -  E(KX) 
h  **  h(_P)  where  £■  (Pj,  ....  Pn>  if  it  is  assumed 
that  all  components  are  independent. 
h(P)  -  h(P)  when  P  *  P^  *  *  •  *  *  ■  PR 

Components  are  often  subject  to  the  same  loads  and  a  common 
environment  so  that  component  failures  may  be  highly  correlated  rather 
than  independent.  The  following  definition  is  a  type  of  correlation 
useful  in  reliability  theory. 

Definition:  Random  variables  T, ,  ....  T  are  associated  if 
COV(F(T) ,  G(T)J  £0  for  all  pairs  of  increasing  binary  functions  F 
and  G  (when  the  covariance  exists). 
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Many  reliability  bounds  have  been  developed  for  systems  composed  of 
associated  components.  These  bounds  can  usually  be  improved  through 
modular  decomposition. 

An  implicit  assumption  in  the  preceeding  discussion  is  that  time  is 
not  considered  or  is  fixed.  In  practice  each  component  will  have  a 
random  life  length  T  governed  by  a  life  distribution  F^t). 

Notation;  F^t)  H  1-F^t)  -  PU^t)  -  1)  -  P(Tt  >  t) 

F(s 1 1)  =  F(t+s)/F(t)  if  F(t)  >  0 

»  conditional  reliability  for  a  component  of  age  t 

h(F)  =  P(«(X(t)  -  1))  where  F-  (F^t) . Fn(t)) 

.x  ,  ,  F(t+s)-F(t) 

r(t)  »  failure  rate  =  1 im  - 

8+0  F(t) 

■  f(t)/F(t)  when  f(t)  exists  and  F(t)  >  0 
t 

F(t)  -  exp  (-  /  r(s)ds) 

0 

Components  are  sometimes  classified  according  to  their  life  distri¬ 
butions.  Several  categories  of  life  distributions  are  defined  below. 
Useful  reliability  bounds  have  been  developed  for  each  category. 

Definition;  A  distribution  F(t)  or  a  random  variable  (component)  with 
that  life  distribution  is  said  to  be 

-Increasing  Failure  Rate  (IFR)  if  F(s|t)  is  decreasing  in 
t  v  s  >  0. 
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-Decreasing  Failure  Rate  (DFR)  if  F(s|t)  is  Increasing  in 


t  v  s  >  0. 

-Increasing  Failure  Rate  Average  (IFRA)  if  — ( 1 / t )  log[F(t))  is  in¬ 
creasing  in  C. 

-Decreasing  Failure  Rate  Average  (DFRA)  if  — < 1 / t )  log[F(t)l  is  de¬ 
creasing  in  t. 

-New  Better  than  Used  (NBV)  if  F(s+t)  <  F(s)F(t)  *  s  >  0,  t  _>  0. 
-New  Worse  than  Used  (NWU)  if  F(s+t)  >  F(s)F(t)  v  s  >  0,  t  >  0. 

-New  Better  than  Used  in  Expectation  (NBUE)  if  F(X)dX  |iF(t) 
where  p  -  ET  <  ®. 

-New  Worse  than  Used  in  Expectation  (NWUE)  if  F(X)dX  pF(t). 

If  F(t)  has  a  density  f(t),  then  IFR(DFR)  means  that  the  failure  rate 
is  increasing  (decreasing)  in  t,  and  IFRA(DFRA)  means  that  /*  r(s)ds 
is  increasing  (decreasing)  in  t.  It  can  be  shown  that  IFR  IFRA  •=¥ 
NBU  and  that  DFR  DFRA  NWU.  It  can  also  be  shown  that  IFRA  and  NBU 
distributions  are  closed  under  the  formation  of  coherent  systems,  that 
IFR,  IFRA,  and  NBU  distributions  are  closed  under  convolutions,  and  that 
DFR,  DFRA,  and  NWU  distributions  are  closed  under  mixtures. 

1 .2  Previous  Generalizations 

The  first  attempts  to  treat  multistate  reliability  from  a 
functional  viewpoint  were  entitled  cannibalization.  Cannibalization  is 
the  use  of  parts  from  several  failed  units  to  form  operational  equip¬ 
ment.  If  one  aircraft  has  a  damaged  tire  and  one  has  an  inoperative 
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radio,  a  alaple  transfer  allows  the  fleet  to  have  an  operational  air¬ 
craft.  The  first  article  on  this  subject  was  Hirsch,  Heisner,  and  Boll 
[1968].  This  paper  treated  binary  components  and  a  multistate  structure 
function  with  restrictions  similar  to  coherence.  Each  component  is 
allowed  to  be  used  in  several  locations.  A  cannibalization  operation  is 
the  transfer  of  some  or  all  components  to  different  positions  within  the 
system,  providing  an  increase  in  the  system  state.  Conditions  are 
derived  under  which  the  state  of  the  system  can  be  determined  from  the 
number  of  each  type  of  component  available.  Although  different 
terminology  is  used,  ideas  such  as  cut  sets  and  k-out-of-n  structures 
are  described.  Hochberg  [1973]  extended  these  results  from  systems  with 
binary  components  to  systems  with  multistate  components.  Simon  [1972] 
obtains  bounds  on  P($(X(t))  >_  j )  in  cannibalized  systems.  This  is 
accomplished  by  placing  restrictions  on  the  cannibalization  operations. 

The  first  extension  of  coherent  structure  function  theory  to 
continuous  components  is  contained  in  Postelnicu  [1970],  In  this  paper 
the  component  and  system  states  are  any  values  in  the  unit  interval. 

The  structure  function  used  is  closely  related  to  coherent  structure 
functions.  Two  reliability  bounds  were  obtained  in  the  paper.  A  second 
paper  dealing  with  a  continuous  state  space  was  Ross  [1979].  He  defines 
an  1FRA  process  and  an  NBU  process.  These  definitions  are  used  to 
generalize  the  IFRA  and  NBU  closure  theorems  (closure  under  the 
formation  of  coherent  systems). 

The  first  paper  intended  primarily  to  generalize  coherent  structure 
function  theory  was  Barlow  and  Wu  [1978].  In  this  paper  a  system  is 
modelled  as  though  it  were  binary,  and  the  minimum  path  sets  and  minimum 


cut  sets  ate  determined.  The  components  and  structure  function  are  then 
allowed  to  be  any  integer  among  (0,l,...,M).  The  system  state  is 
defined  as  the  state  of  the  worst  component  in  the  best  min  path  which 
is  the  same  as  the  state  of  the  best  component  in  the  worst  min  cut, 
i  «e . 


4>(X)  -  Max  Min  (X.)  -  Min  Max  (X.) 
j  iePj  1  j  i€Ki  i 


where  is  the  jth  min  path  and  is  the  jth  min  cut.  This  is 

very  restrictive  since  the  relationship  between  the  system  and 
components  is  not  allowed  to  vary  as  the  system  level  varies. 

Reliability  and  stochastic  system  performance  are  considered,  and  a 
variant  of  the  IFRA  closure  theorem  is  proved. 

Another  article  containing  a  restrictive  generalization  of  coherence 
is  El-Neweihi,  Proschan,  and  Sethuraman  [1978].  The  system  and 
components  may  be  any  integer  in  (0,1,..., M),  and  $  is  coherent  if 


(1) 

$(X)  is  increasing  in  X, 

(2) 

there  exists  X 

such  that 

♦(j  ±  ,x)  -  j  while  H^.x)  i  j 

v  component  i 

and  system 

level  j,  and 

(3) 

$(j_)  »  j  where 

J  (  j  ^  *  J  2 1 

Minimum  path  sets,  minimum  cut  sets,  and  utility  functions  are  discussed 
in  the  paper.  It  is  shown  that  EU($(X))  is  stochastically  increasing 
in  X  where  U( •)  is  a  utility  function.  Some  reliability  bounds  are 
given,  and  the  NBU  closure  theorem  is  generalized  using  a  different 
definition  for  NBU  process  than  the  one  in  Ross  [1979]. 
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A  less  restrictive  generalization  of  coherence  may  be  found  in  a 
discussion  of  ternary  (3-state)  systems  by  Butler  [1979].  Component  i 


is  defined  to  be  relevant  if  there  exists  X  such  that 
4(2^, X)  t  *0  ,X>.  Component  i  is  fully  relevant  if  there  exists 
X  such  that  $(2^,X)  J*  ♦O^X)  and  there  exists  Y  such  that 
♦(1^,Y)  j*  ♦(O^.Y).  Fully  relevant  means  that  every  state  of  every 
component  is  relevant.  It  is  assumed  that  all  components  are  relevant. 

0  is  coherent  if 

(1)  *(0)  -  0,  $<2)  -  2,  and 

(2)  <KX)  is  increasing  in  X. 

Using  this  definition  of  coherence,  Butler  extends  the  ideas  of 
reliability  importance  and  structural  importance,  including  new 
importance  measures  introduced  in  the  paper,  to  multistate  systems.  In 
a  second  paper,  Butler  [1982]  extended  several  bounds  on  system 
reliability  to  the  multistate  case.  Another  paper  using  this  definition 
of  coherence  in  a  ternary  system  is  Hatoyama  [1979],  although  it  is  not 
explicitly  stated  that  $(0)  *  0  and  »  1.  Duals,  path  sets,  and 

cut  sets  are  defined  in  this  paper,  and  some  bounds  on  system 
reliability  are  derived.  Most  of  the  paper  deals  with  a  specialized 
type  of  system  composed  of  modules  of  series  or  parallel  elements  with  a 
series  organizing  structure. 

A  discussion  of  several  possible  generalizations  of  coherent 
structure  functions  may  be  found  in  Griffith  [1980].  A  function 
♦  :  (0,1,..., M}n  ♦  {0,1,..., M}  is  called  a  multistate  monotone  system 


(1)  $(X)  is  increasing  in  X,  and 

(2)  min  (X.)  <  *(X)  <  max  (X.). 

i  i  -  -  -  i  i 

If  «KX)  is  a  MMS,  it  is  called 

(A)  Strongly  coherent  if  there  exists  X  such  that  ^(j^.X)  -  j 
and  $(Jl^,X)  /  j  *  l  +  j ,  v  component  i,  and  *  state  j. 

(B)  Coherent  if  there  exists  JC  such  that  $((j-l)^,X)  <  ♦(j^.X) 

v  component  i  and  *  state  j  1. 

(C)  Weakly  coherent  if  there  is  X  such  that  $(0^,X)  <  ♦(M^,X) 

v  component  i. 

Note  that  conditions  (1),  (2),  and  (A)  correspond  to  coherence  in 
El-Neweihi,  Proschan,  and  Sethuraman  [1978],  If  condition  (2)  is 

weakened  to  $(0)  a  0  and  i(2)  ■  2,  then  conditions  (1),  (2),  and  (C) 

correspond  to  coherence  in  Butler  (19791.  The  difference  between 
relevant  and  fully  relevant  in  that  article  is  the  same  as  the 
difference  between  conditions  (B)  and  (C).  Griffith  defines  the  dual  of 
a  MMS,  4>^,  and  shows  that  it  possesses  the  same  type  of  coherence  as 
4>.  He  also  discusses  modules  and  utility  functions  in  the  multistate 
setting.  Reliability  importance  is  defined,  and  it  is  shown  that  system 
utility  can  be  expressed  as  the  product  of  the  reliability  importance  of 
a  component  and  the  probability  vector  of  the  component. 

The  analysis  of  specialized  oultistate  systems  may  be  found  in 
Fardls  and  Cornell  (1981).  This  paper  considers  systems  for  which  there 
are  modules  of  completely  interch£ngable  components  in  series  or 
parallel.  Truth  tables  for  these  types  of  systems  are  analyzed. 
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2.  THE  MULTISTATE  MODEL 


In  binary  coherent  structure  function  theory  and  in  previous  exten¬ 
sions  to  the  oultistate  case,  each  component  and  the  system  structure 
function  were  ail  assumed  to  have  the  same  mraber  of  states,  labelled 
0,  l,  ....  M.  When  a  component  or  the  structure  function  had  only 
J  <  M  +  l  natural  distinct  states,  more  states  were  added  by  making  the 
last  states  J  +  1,  J  +  2,  ....  M  identical  to  state  J.  This  could 
significantly  enlarge  the  state  space  which  is  computationally  ineffi¬ 
cient.  In  addition,  some  theoretical  results  are  unnecessarily  weakened 
as  discussed  later.  This  enlargement  of  the  state  space  is  herein  elim¬ 
inated  by  allowing  every  component  and  the  system  structure  function  to 
have  a  different  number  of  states.  This  means  that  every  state  of  every 
component  will  be  relevant  for  determination  of  the  system  state  since 
component  states  that  are  not  relevant  are  eliminated. 

In  Section  2.2.1  it  is  shown  that  previous  definitions  of  coherence 
lead  to  a  situation  in  which  every  system  can  be  modelled  as  a  coherent 
system,  rendering  the  concept  meaningless.  This  problem  arises  because 
the  states  of  the  structure  function  do  not  have  to  be  monotonlcally  or¬ 
dered  by  increasing  utility.  A  new  definition  of  coherence  incorpora¬ 
ting  a  utility  function  is  proposed,  and  various  types  of  coherence  are 
discussed.  This  definition  requires  that  every  state  of  every  component 
be  relevant.  Previously,  it  was  only  possible  to  require  that  every 
component  be  relevant  (called  weak  coherence)  because  of  the  aforemen¬ 
tioned  enlargement  of  the  state  space.  Using  coherence  rather  than  weak 
coherence  strengthens  some  of  the  results  in  Chapters  2  and  3.  The  con¬ 
cepts  of  minimum  path  sets,  minimum  cut  sets,  series,  parallel,  and 
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k-out-of-n  have  been  redefined  in  terms  of  the  value  of  the  structure 
function.  Section  2.2  presents  definitions  for  terms  such  as  minimum 
cut  set  at  level  j  and  series  system  at  level  k. 

The  dual  of  a  structure  function  is  defined  in  Section  2.2.4,  and  it 
is  shewn  that  the  dual  possesses  the  same  type  of  coherence  as  the  ori¬ 
ginal  structure  function.  Rather  than  dealing  separately  with  each  com¬ 
ponent  in  a  coherent  system,  it  is  often  easier  to  consider  subsets  of 
components  called  modules  together  with  an  organizing  structure  for  the 
modules.  The  main  result  of  Section  2.2.5  is  the  relationships  that 
exist  between  the  type  of  coherence  possessed  by  the  overall  system  and 
the  various  types  of  coherence  associated  with  the  modules  and  their 
organizing  structure.  The  general  model  discussed  herein  has  an  advan¬ 
tage  in  this  modular  decomposition  since  it  turns  out  that  coherence  of 
the  modules  and  their  organizing  structure  permits  a  stronger  conclusion 
than  is  possible  with  weak  coherence  assumptions. 

2.1.  Description  of  the  Model 

Consider  a  system  composed  of  n  components,  and  let 
€  {0,1,...,N  }  be  the  state  of  the  ith  component.  Let 

<KX)  £  {0,1,..., M}  be  the  state  of  the  system. 

4>  :  (0,1 . Nj }  x  {o,l . N2>  x  ...  x  {0,1,...  .N^ }  -*■  {0,1,...,M}  . 

The  total  number  of  vectors  X  is  (N,+l) (N„+l)  •••  (N  +1).  In 
probabilistic  evaluations,  the  component  states  and  the  system  state 
become  random  variables. 
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Notation : 


P(Xi  -  j) 


‘ij 


p(Xi  > 


j) 


I1 


'ik 


-i  (pi0’ ‘ '  ,PiNi)  -S-i  =  (qi0’ ,qiNi) 


—  — 

-  — 

pi 

% 

• 

• 

q 

c 

• 

• 

p 

q 

— n 

—  — 

—  — 

Note  that  q  =  1  v  i.  J?  is  not  really  a  matrix  since  its  ith  row 
has  length  +  1,  and  thus  its  row  length  may  vary.  However,  it  will 
be  called  a  probability  matrix  for  lack  of  a  better  term.  Reliability 
can  no  longer  be  defined  as  the  probability  that  the  system  functions 
since  the  system  may  operate  at  one  of  several  levels. 

Definition:  Reliability  at  level  k  is  hk  =  P(  $(X)  _>  k).  When  the 

components  are  independent,  this  probability  depends  only  on  £  and  is 
denoted  h  (_P) .  When  the  system  level  is  clear  or  when  the  discussion 
applies  to  any  level  k,  the  superscript  will  sometimes  be  dropped,  and 
reliability  at  any  level  k  will  be  denoted  h  or  h(_P) . 


When  time  is  a  variable  in  reliability  calculations,  it  is  assumed 
that  each  component  i  begins  in  state  N^  at  t  -  0  and  has  a 
distribution  p|(t)  which  represents  the  time  until  the  component 
state  drops  to  or  below  a  level  j. 
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Notation:  F^t)  =  1-F±( t)  -  Ptt^t)  >  j)  -  P(Xi(t)  >  j+1) 

0  N1 

£t  =  (F”(t) . Fi1(t))  F  = 

hk(F)  i  P(d>(X(t))  >  k)  . 

Again  £  is  a  matrix  possibly  with  rows  of  unequal  length.  The 

It 

reliability  at  level  k  may  be  denoted  h  (£(t))  when  it  is  desirable 
to  explicitly  show  the  time  dependence,  and  the  superscript  may  be 
dropped  when  it  is  unnecessary. 

2.2  Definitions 

2.2.1  Coherence 

The  definition  of  binary  coherent  structure  function  includes  two 
main  premises: 

(1)  All  components  must  be  relevant,  and 

(2)  An  increase  in  the  state  of  a  component  cannot  cause  a  decrease 
in  the  state  of  the  system. 

These  premises  seem  very  reasonable,  so  reasonable,  in  fact,  that  it  is 
difficult  to  think  of  a  situation  in  which  they  would  not  hold.  The 
first  premise  -  all  components  relevant  -  will  always  hold  since  if  it 
did  not,  the  irrelevant  components  could  be  disregarded  and  a  new 
structure  formed  with  only  the  relevant  components.  However,  the  second 
premise  may  fall  to  hold  because  of  problems  in  defining  success  and 
failure.  If  two  incoming  water  lines  feed  a  pipe  in  which  the  desired 
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o 

water  temperature  is  50  F.,  then  success  for  each  incoming  water  line 

might  be  defined  as  being  at  50°  F.  (with  some  tolerance).  If  the 

o  c 

water  in  one  line  is  60  F.  and  the  water  in  the  other  line  is  AO  F., 

then  the  system  might  be  considered  a  success  even  though  both  incoming 

water  lines  are  failed.  Furthermore,  If  the  first  water  line  is 

repaired,  i.e.  ,  the  water  temperature  is  lowered  from  60°  F.  to  50° 

F.,  then  the  system  will  fail.  In  fault  trees  there  is  a  concept  called 

an  "exclusive  or  gate"  which  mea.'s  that  one  of  two  inputs  but  not  both 

is  necessary  for  success.  In  structure  function  notation,  this  means 

that  $(0,0)  =  $(1,1)  =  0  and  $(1,0)  *  1,  which  does  not  fit  the 

definition  of  coherence.  These  examples  of  non-coherent  systems  are 

rather  contrived,  however,  and  an  engineer  would  never  expect  to  find  a 

non-coherent  system  in  practice. 

It  seems  a  relatively  easy  matter  to  extend  binary  coherence  to 

multistate  systems.  The  following  definition  is  based  on  the  second 

premise  of  coherence  discussed  in  the  previous  paragraph. 

Definition :  $(X)  is  a  Monotone  Structure  Function  (MSF)  if: 

(1)  $(0)  =  0,  $( N)  =  M  where  N  =  (N,,N,,...,N  ),  and 

—  —  —  tin 

(2)  $(X)  is  increasing  in  X* 

The  concept  of  relevance  may  be  used  in  a  multistate  setting  to  mean 
either  that  every  component  is  relevant  or  that  every  state  of  every 
component  is  relevant.  Following  Griffith  [1980],  the  following 
definition  is  presented. 


Definition :  A  component  1  Is  said  to  be 

(A)  Relevant  If  there  exists  X.  such  that  >  <!>(0^,JO. 

(B)  Fully  Relevant  if  »  j  «  1,  2,  N^ ,  there  exists  X  such 

that  <t»(j1,X)  >  ♦((j-l)1,X). 

Clearly,  a  component  which  is  fully  relevant  is  also  revelant  but  not 
vice  versa. 

A  reasonable  definition  of  coherence  would  seem  to  be  a  MSF  with  all 
components  either  relevant  or  fully  relevant.  There  is  a  fly  in  the 
ointment,  however,  as  shown  in  the  following  proposition. 

Proposition :  All  multistate  systems  may  be  modelled  by  a  structure 

function  which  has  properties  (1),  (2),  and  (B). 

Proof :  The  proof  is  constructive.  Assume  that  each  of  n  components 

has  a  natural  state  space  {0,l,...,N  }.  Arbitrarily  assign 

♦(0)  =  0,  ♦(ll,0)  =  1,  4>(1,,0)  =  2 . ♦(ln,0)  =  n, 

♦(lplj.O)  =  n+l . *U)  =  2n-l,  0(2 1(0)  =  2n . 

n 

<t>(N)  =  H  (N+l)  -  1  =  M 
i  =  l  1 

By  construction,  0(0)  =  0,  0(N)  =  M,  0(X)  is  increasing  in  X,  and 
0(ji,X)  >  0((j"l)i,  X)  v  j , i , X.  3 

Thus,  using  (1),  (2),  and  (A)  or  (B)  as  a  definition  of  coherence  means 
that  every  system  is  coherent.  Some  people  might  view  this  as 
desirable,  but  it  would  be  nice  if  there  was  a  feature  which  separated 


coherent  and  non-coherent  systems. 


Implicit  in  the  use  of  binary  coherence  is  that  state  l  is  desirable 
(success)  while  state  0  is  undesirable  (failure).  This  notion  has  not 
yet  been  incorporated  into  multistate  coherence.  In  the  proof  of  the 
previous  proposition,  4>(2^,^))  =  2n  >  $0)  =  2n-l,  but  it  is  very 
possible  that  U(4>(2ltO))  <  U($>U))  where  U(  •)  is  a  utility 
function.  Normally,  an  engineer  would  specify  system  states  in  order  of 
increasing  utility  rather  than  constructing  a  structure  function  as  in 
the  preceding  proposition.  The  conditions  (1),  (2),  and  (B)  would  then 
be  tested  in  the  framework  of  the  specified  structure  function.  Thus, 
although  a  system  can  always  be  assigned  a  structure  function  which 
makes  the  system  coherent,  a  more  natural  structure  function  with  states 
monotonically  ordered  by  increasing  utility  may  be  non-coheren t.  This 
leads  to  the  following  definition  of  coherence. 


Definition:  Let  $(X)  be  a  MSF  and  let  U(i)»(X))  be  the  corresponding 
utility  function  that  assigns  utility  a  to  state  j.  d>(JC )  is 
coherent  if 

(1)  every  component  is  fully  relevant,  and 

(2)  aj  >  aj_1  v  j  =  1,  2,  ....  M. 

The  first  part  of  the  definition  can  always  be  satisfied  by  eliminating 
irrelevant  states.  The  second  part  of  the  definition  means  that  the 


operational  value  of  the  system  increases  as  the  system  state 
increases.  Obviously,  the  utility  function  must  reflect  the  true  value 
of  each  system  state  since  otherwise  setting  U($(X))  =  $(X)  would 
return  the  problem  to  its  original  status.  The  definition  also  implies 


that  Irrelevant  system  states  will  be  eliminated, 
define  and  a'  by 


a.  ,  for 
J-l 


-22- 


1 


f'(X) 


|  $(X)  whenever  4>(X)  j  -  1 
I  #(X)  -  1  whenever  4>()p  j 


i  whenever  i  <_  j  —  1 
(  whenever  i  ^  j 


Thus,  every  state  of  the  system,  as  well  as  every  state  of  every  compo¬ 
nent,  will  be  relevant. 

If  the  components  and  system  are  required  to  have  the  same  state 
space,  say  {0,1,..., M},  it  may  not  be  possible  for  each  component  to  be 
fully  relevant.  This  is  the  situation  in  the  papers  reviewed  in  Section 
1.2.  To  allow  the  situation  described  herein  to  be  compared  with  pre¬ 
vious  literature,  the  following  two  definitions  are  presented  (using 
terminology  from  Griffith  [1980]). 


Definition:  Let  4>(X)  be  a  MSF ,  and  let  U($(X))  be  the  corresponding 
utility  function  that  assigns  utility  a^  to  state  j. 

<>(X)  is  weakly  coherent  if 

(1)  every  component  is  relevant,  and 

(2)  a,  >  v  j  =  1,  •••.  M. 

$(X)  is  strongly  coherent  if 

(1)  v  component  i  and  state  j,  there  exists  X  such  that 
<KJ1,X)  -  j  while  (jO^X)  J4  j  for  i  #  j,  and 

(2)  a^  *  j  *  1. 


It  is  also  assumed  in  Griffith  [1980]  and  El-Neweihi,  Proschan,  and 
Sethuraraan  [1978]  that  <K^)  =  j,  but  this  restriction  is  not  necessary 
for  any  of  the  results  in  those  papers  or  the  results  herein. 


It  is  clear  that  strong  coherence  coherence  weak  coherence. 
Examples  2.1  and  2.2  are  presented  to  show  that  the  reverse 
Implications  are  not  true. 

Example  2,1:  Consider  two  ternary  components  each  with  state  space 
{0,1,2},  and  let  a^  =*  j  -v  j .  Let 

4>(0 , 0)  =  0, 

(1,0)  =  >(0,1)  =  >(1,1)  =  ‘K0 , 2)  -  >(2,0)  =  l,  and 
XI, 2)  =  S(2,l)  =  >(2,2)  =  2. 

Since  #(1,0)  >  #(0,0)  and  #(2,1)  >  >(1,1),  component  1  is  fully 
relevant.  Since  the  system  state  is  symmetric  with  respect  to  the 
components,  component  2  is  also  fully  relevant,  and  the  system  is 
coherent.  To  show  that  #  is  not  strongly  coherent,  consider  #(1,X2). 

X2  *  0  ==>  <t>(l,X2)  =  #(2,X7)  =  1 

x2  =  1  ==)>  <f(0,x2)  =  #0  ,x2)  =  l 

X2  =  2  =*  #(l,X2)  =  4)(2  ,X2)  =  2  . 

Thus,  there  is  no  X2  such  that  #(1,X2)  =  1  and  #(jj,X2)  t  1  for 
j  =  0,  2,  so  part  (l)  in  the  definition  of  strong  coherence  is  not 
satisfied.  □ 

Example  2,2:  Again  consider  two  ternary  components  with  a^  =  j . 

Let  #(0,0)  -  #(1,0)  =  0, 

#(0,1)  -  #(1,1)  -  1,  and 
#(0,2)  -  #(2,0)  -  #(2,1)  -  #(1,2)  =  #(2,2)  -  2. 

Since  #(2,0)  >  #(0,0)  and  #(0,2)  >  #(0,0),  #  is  weakly  coherent. 
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However,  ♦(0,X2)  “  ^(l.X^)  v  X2,  so  X^  is  not  fully  relevant,  and 
the  system  is  not  coherent.  Note  that  we  can  combine  states  0  and  l 
of  component  1  to  get  a  coherent  structure  ♦'  which  is  equivalent  to 
♦  as  follows.  Let  X^  »  max  (X^-1,0),  and  define  ♦'  by 

♦  ’(O'.O)  =«  0  ,  ♦,<0,,1>  -  1,  and 

♦'(O', 2)  =  ♦•<1’,0)  -  ♦,(1,,1>  =  ♦•( 1* ,2)  -  2  . 

This  is  a  special  case  of  Theorem  2.1.  D 

Theorem  2.1:  A  relevant  component  can  be  made  fully  relevant. 

Proof :  Assume  ♦(N^.X)  >  it(0  ^  ,X)  for  some  X  and  4>(j  ,X) 

=  ♦((j-1).,  X)  -v  X.  Define  ♦’  and  X’  by 

(♦(k  ,X)  for  k’  <  j  -  1 
(♦((k+DpX)  for  k’  >  j 

|VxUJ-‘ 

XI  - 

(xri  vx;>j  . 

The  new  structure  function  ♦’  is  the  same  as  ♦  except  states  j  and 
j  -  1  of  component  1  have  been  combined.  Repeat  the  process  if  any 
components  are  still  not  fully  relevant.  0 

Theorem  2.1  shows  that  weak  coherence  is  a  useful  concept  only  when  it 
is  desirable  to  have  the  same  state  space  for  all  components. 

For  simplicity,  throughout  the  remainder  of  this  thesis,  it  is 
assumed  that  all  components  are  relevant  if  all  components  are  required 
to  have  the  same  number  of  states  and  fully  relevant  otherwise.  The 


-25- 


1 


results  contained  herein  will  often  hold  assuming  just  that  $  is  a  MSF 
which  need  not  be  coherent. 

2.2.2.  Series,  Parallel,  and  k-out-of-n 

The  definitions  of  series,  parallel,  and  k-out-of-n  systems  are 
straightforward  generalizations  of  their  binary  counterparts. 

Definition;  The  system  represented  by  a  MSF  $  is 

(1)  Series  if  4>(X )  =»  min  (X.) 

-  i  1 

(2)  Parallel  if  $(X)  -  max  (X,),  and 

-  -  i  i 

n 

(3)  k-out-of-n  if  $(  X)  »  max  {j  :  J  1  ^  >  k} 

i-l  '■  Xi-J  ' 

Clearly,  a  series  system  is  an  n-out-of-n  system,  and  a  parallel 

system  is  a  1-out-of-n  system.  Also,  M  *  min  (  V  )  for  a  series 

i  i 

system,  and  M  =  max  (N^)  for  a  parallel  system.  These  types  of 
systems  are  useful  since  the  position  of  the  component  within  a  system 
is  irrelevant;  only  the  numerical  value  of  the  component  state  has  an 
impact.  When  this  occurs,  $  is  said  to  have  interchangeable  compo¬ 
nents. 

Exaaple  2.3:  Let  $  be  a  ternary  MSF  composed  of  3  interchangeable 
ternary  components, 

(A)  $  is  series  if:  $(0,0,0)  «  $(1,0,0)  -  (1,1,0)  -  $(2,0,0) 

-  $(2,1,0)  -  $(2,2,0)  -  0 
$(1,1,1)  -  $(2,1,1)  -  $(2,2,1)  -  1 
$(2,2,2)  -  2  . 
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(B)  $  is  parallel  if:  $(0,0,0)  ■  0 

$(1,0,0)  -  $(1,1,0)  -  $(1,1,1)  -  1 
$(2,0,0)  -  $(2,1,0)  -  $(2,1,1)  -  $(2,2,0) 

-  $(2,2,1)  -  $(2,2,2)  -  2 

(C)  $  is  2-ouf-of-3  if:  $(0,0,0)  -  $(1,0,0)  -  $(2,0,0)  -  0 

$(1,1,0)  -  $(1,1,1)  -  $(2,1,0) 

-  $(2,1,1)  -  1 

$(2,2,0)  -  $(2,2,1)  -  >  (2,2,2)  -  2  .0 

Theorem  2.2:  Let  $  be  a  MSF . 

(i)  $(X  v  Y )  >  $(X)  v  $( Y)  where 

X  v  Y  -  (max  (X,,Y,),  ....  max  (X  ,Y  )),  and 

—  —  11  n  n 

(ii)  $(X  a  y)  <  $(X)  a  $( Y)  where 

X  a  Y  -  (min  (X  ,Y  ) ,  ....  min  (X  ,Y  )). 

—  —  11  n  n 

If  $  is  a  coherent  MSF  with  M  -  »  •••  •  N^,  then  equality  in  (i) 

<->*  $  is  a  parallel  structure,  and  equality  in  (ii)  <“*  $  is  a  series 
structure. 

Proof:  (i)  By  definition  IvlH  and  X  v  Y  >_  Y.  Since  $(X)  is 
increasing  in  X,  $(X  v  Y)  >  $(X)  and  $(X  v  Y)  >  $(Y).  Thus, 

$(X  v  Y)  >  $(X)  v  $(Y) . 

Assume  that  $(X)  ”  max  (X^  (parallel).  Then  $(X  v  Y)  -  max  (X^  v  Y^) 

■  (max  (X. ) 1  v  (max  (Y.)]  -  $(X)  v  $(Y).  Now  assusme  that  $  is 

i  1  i  1  " 

coherent  and  that  $(X  v  Y)  »  $(X)  v  $(Y).  For  each  i,  there  exists  X 
such  that  $((J-1)1.X)  <  $(J ^ »ii) • 
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4(ji,X)  -  ♦((ji,0)  V  (Oi,X))  -  4(^,0)  V  $(OitX) 

-  ♦(ji,0)  V  [^(0i,(Xk>k,0)  v  ♦(01,Ok,X)J 

.  ...  »  4(j  ,0)  V  [max  0((X  )  ,0))  . 

i  -  k*i  k  k  - 

Similarly,  4((j-l).,X)  <*  4((j"l)<,0)  v  [max  4((X.),  ,0]  . 

1  1  kfi  K  K 

Since  4(  (j  ~  1 )  .^  ,_X)  ^  4(j^.X)»  the  above  equations  imply  that 

*((j-D1,0)  <  <Kj1,0)  *  j  or  0  =  4(0)  <  <K1.,0)  <  •••  <  ♦(Ni,0)  <  M. 

If  M=N^  =*•••  =  N^,  this  means  that  4(j^,())  =  j  *f  i  and  j  or 

<t>(X)  ■  max  (X.) 

~  i  1 

The  proof  of  (ii)  is  similar.  0 

Example  2.4:  To  show  that  M=N,  »•••=*  N  is  a  necessary  part  of 
- - -  l  n 

the  proof,  let  XJ  be  binary  and  X^  be  ternary.  Let  4(0,0)  ■  0, 

4(0,1)  *■  1,  and  HO, 2)  <■  4(1,0)  *  4(1,2)  =  4(1,2)  ■  2.  It  is  easy  to 

show  that  4  is  a  coherent  MSF  and  4(X  v  Y)  «  4(X)  v  4(Y) ,  but 

4(X)  /  max  (X.).  To  show  that  weak  coherence  does  not  suffice  to 
i  1 

prove  the  theorem,  make  component  l  ternary  with  4(2,0)  «  4(2,1)  = 

4(2,2)  ■  2.  Then  4  is  weakly  coherent,  but  4(1.0)  ■*  2  t  max  (X  )  .0 

i  1 

These  concepts  may  be  applied  to  a  single  system  state  rather  than 
the  entire  system. 

Definition:  The  system  represented  by  a  MSF  4  is 
(1)  Series  at  level  j  if  4(X)  -  j  <— »  min  (X^  «  j. 
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(2)  Parallel  at  level  j  if  $(X)  -  j  <«*  max  (X^)  »  j. 

(3)  k-out-of-n  at  level  j  if  $(X)  <*»  max  {1:  I  -  j. 

A  series  system  is  a  series  system  at  level  j  *  j .  Note  that  if 
6  is  k-out-of-n  at  level  j,  it  must  be  at  least  k-out-of-n  for  all 
system  levels  larger  than  j.  To  see  this,  let  $  be  k-out-of-n  at  level 
j  and  (k-l)-out-of-n  at  level  j  +  1.  Then  $(  (j+1) ^ , . . • ,( j+l)k_j »£>  <  3 
by  the  first  criterion,  but  i>((j+l)^ .  ••  •  »(j+l)^_^,£)  j  +  1  by  the 
second  criterion  which  is  clearly  impossible. 

Example  2.5:  Let  $  be  a  MSF  composed  of  3  interchangeable  components 
with  M  *  Nj  *  «  3.  We  construct  $  to  be  parallel  at  level 

1,  2-out-of-3  at  level  2,  and  series  at  level  3  as  follows. 

♦(0,0,0)  -  0  $(3,3,3)  -  3 

$00  -  1  for  X  €  {(1,0,0), (1,1,0), (1,1,1), 

(2, 0,0), (2, 1,0), (2, 1,1), 

(3, 0,0), (3, 1,0), (3,1,1)} 

$(X)  -  2  for  X  «  {(2,2,0), (2,2,1), (2, 2, 2). 

(3, 2,0), (3, 2,1), (3, 2, 2), 

(3, 3,0), (3, 3,1), (3, 3, 2)}  .  U 

2.2.3.  Mir  Paths  and  Min  Cuts 

In  the  binary  case,  a  minimum  path  set  is  a  list  of  components. 

If  every  component  in  a  minimum  path  set  functions,  then  so  does  the 
system.  In  the  multistate  case,  however,  path  sets  have  to  be 
associated  with  the  appropriate  system  state,  and  each  component  must 
have  a  specified  minimum  state.  Thus,  in  the  multistate  case,  a  minimum 
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path  set  Is  a  list  of  minimum  requirements  for  the  state  of  each 


component. 

Definition:  Let  $(X)  be  a  MSF.  X  is  a  path  vector  at  level  m  if 

$(X)  ^  m.  It  is  a  path  vector  at  maximal  level  m  if  $(X)  =*  ra.  If, 

in  addition,  $(Y)  <  m  whenever  Y  <  X,  then  X  is  called  a  minimum 

path  vector  at  maximal  level  ra.  Let  m  *  (m  ,...,m  )  be  a  vector  such 
. .  —  ■  —  —  1  n 

that  if  X  >  m,  then  $(^()  m  and  if  Jf  <  m,  then  $(_Y)  <  m.  The 

vector  m  will  be  called  a  min  path  (minimum  path  set  at  system 

level  m) .  The  jth  min  path  at  system  level  m  will  be  denoted 

m^  ■  (nrj , . . .  ,m^  )  .  There  are  s  min  paths  for  each  system  level 
—  In  m 

(s  when  the  system  level  is  clear).  Also, 


pj(X> 


n 

1  »  n  1  ; 

{ x  >  rn>  i-1  {X  >  } 


is  the  jth  minimum  path  structure  function  at  level  m. 


Definition:  Let  $(X)  be  a  MSF.  X  is  a  cut  vector  at  level  tn  if 

$(X)  <  m.  It  is  a  cut  vector  at  minimal  level  m  if  $(X)  »  m-1.  If, 

in  addition,  $(Y)  >_  m  whenever  Y  >  X,  then  X^  is  called  a  minimum  cut 

vector  at  minimal  level  m.  Let  m  -  (m, , .  •  •  ,m  )  be  a  vector  such 
- —  l  n 

that  if  X  j<  m,  then  $(^)  <  m  and  if  Y  >  m,  then  $(Y)  m.  The 

vector  m  will  be  called  a  min  cut  (minimum  cut  set  at  system 

level  m) .  The  jth  min  cut  at  system  level  m  will  be  denoted 

m^  «■  (ro^,...,m^).  There  are  t  min  cuts  for  each  system 
—  in  m 

level  m  (t  when  the  system  level  is  clear).  Also, 
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<)<» 


1  {X  i  } 


-  1 


n 

n  1  4 

i-l  {X  <  m^  } 


Is  the  jth  minimum  cut  structure  function  at  level  m. 


Example  2.6:  Let  ♦(X^X^X^)  be  as  In  Example  2.5. 

Min  paths  at  level  l  ■  {(1 ,0,0) ,(0,1 ,0) ,(0,0,1)  } 
Min  cuts  at  level  1  *  {(0,0,0)} 

Min  paths  at  level  2  *  {(2 , 2 ,0)  , (2  ,0, 2)  , (0 , 2 ,2)  } 
Min  cuts  at  level  2  -  {( 3 , 1 , 1 ) , ( 1 , 3 , 1 ) , ( 1 , 1 , 3) } 

Min  paths  at  level  3  *  {(3,3,3)} 

Min  cuts  at  level  3  -  {( 3 , 3 , 2) , ( 3 ,2 , 3) , (2 , 3 , 3) } 


Looking  at  the  min  paths  and  min  cuts,  it  is  easy  to  see  that  <t>  is 
parallel  at  level  l,  2-out-of-3  at  level  2,  and  series  at  level  3.  0 


Theorem  2.3: 


(i) 

(il) 


s  t 

<KX)  >  m  11  P4(X)  -  1  <»*  11  <4(X)  =*  1. 
-  -  j=l  3  -  j-1  J  “ 

s  t 

h®  =  P(  11  pj  (X)  -  1)  =  P(  11  <j  (X)  =  1) 


Proof : 

s 

(1)  11  p. (X)  -  i  means  that  p.(X)  -  1  for  some  j. 

j-1  J  ~  J  " 

Thus,  X  2.  3^  for  80ne  J  80  $(X)  >  m  by  definition.  Now  assume 
$(X)  >  m.  X  is  then  a  path  vector  at  level  m,  and  a  min  path  can  be 
constructed  such  that  X  >  J,  This  implies  that  Pj(X )  -  1 
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since  (p.,1  =  l,2,...,s  }  contains  all  min  paths  at  system  level  ra. 
r  j  J  m 

The  proof  is  similar  for  min  cuts. 

(ii)  Part  (ii)  follows  by  taking  expected  values  in  (i).  !j 

For  independent  components,  it  is  true  that  H(pj(X)  »  i) 
s  i 

-  Ii  j  P(X^  ^nr).  However,  it  is  not  necessarily  true  that 
PCJLj^  p^(X)  =  O  P(p^(X)  =  l).  This  situation  arises  because 

independence  among  components  does  not  imply  independence  among  path 
sets  since  two  path  sets  can  have  a  component  in  common.  The  same 
remarks  apply  to  cut  sets.  This  problem  is  illustrated  in  Example  2.7. 

Example  2.7:  Consider  a  ternary  system  composed  of  two  ternary 
components  with  two  path  sets  at  level  2. 

(2 1  -  2,  2*  -  l)  ,  (2 J  -1,2*-  2) 

-  (1/4, 1/4, 1/2)  ,  V2  =  (1/3, 1/3, 1/3) 

2 

P(  Ii  p.(X)  =  1)  =*  P(X,  =2  and  X.  >  1  or  X,  >  1  and  X.  =  2) 
j=  1  J  -  1  l  ~  1  ~  l 

=  P(X2  >  1)  p12  +  P(X2  -  2)  pu  +  0  •  p1Q  -  3/12 

2 

11  P( Pj  (X)  =  1)  -  l-ll-P(X1  =  2,X2  >  1)]  •  1 1-P(XJ  >1,  X2  =■  2)] 

-  1/2.  □ 

2.2.4.  Duals 

The  dual  of  a  binary  coherent  structure  function  is  useful  in 
reliability  modelling  since  the  minimum  path  sets  of  $  are  the  minimum 
cut  sets  of  and  vice  versa.  Thus,  solving  for  the  reliability  of 


the  dual  system  is  equivalent  to  solving  for  the  unreliability  of  the 
original  system.  A  similar  relationship  exists  in  the  multistate  case. 
This  is  mathematical  justification  for  computing  system  reliability 
using  block  diagrams  and  fault  trees  which  model  either  system  success 
or  failure. 

Definition :  Let  $  be  a  MSF.  Its  dual,  is  defined  by 


D 

*  (X)  = 

M  -  <KN~X). 

Theorem 

2.4:  ( <{>D)D  »  4>. 

Proof : 

(<t>D)D(X)  *  M-$U(N-X)  -  M- 1 M- 4>(JN-( N-X ) )  ; 

1  =  <KX).  □ 

Theorem 

2.5:  (Griffith  [1980]):  The  dual  of 

a  MSF  is  a  MSF  and 

possesses  the  same  type  of  coherence  as  the  original  structure  function. 

Proof :  $D(0)  =  M-<KN-0)  *  M-M  *  0. 

$D(N)  =  M-  <t>(  N-N )  =  M, 

If  X  >  Y,  <J>D(X)  =  M-iKN-X)  >  =  $D( Y ) .  This  proves  that  if 

$  is  a  MSF,  then  <J>D  is  a  MSF.  Now  assume  4>  is  coherent.  Let  If 
be  such  that  <t>(  (N±-j  )  ± )  <  <*( (N^j  +  l )  ±  , Y ) ,  and  let  X  -  N-Y. 

A(j-Oi,X)  -  M  -  %((j-l)t,X))  =  M  -  <t»((N.-j  +  l)1,Y) 

<  M  -  ♦((H1-j)1,Y)  =  M-<KN-(j1,X>)  =  Aj^X). 

The  previous  inequality  shows  that  $  is  coherent.  If  $  is  weak 
coherent,  let  Y  be  such  that  $(0^,Y)  <  (KN^.Y),  and  use  the  same 
argument.  If  $  is  strong  coherent,  let  _Y  be  such  that  (N^-j  >^,1?) 
-  N^-j  and  ♦((N^-JO^.X )  i  N  -j  v  J?jAj ,  and  use  the  same  argument.  D 
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Theorem  2.6:  Let  X  be  a  path  (cat)  vector  at  level  m  for  a  MSF  0. 
Then  ^-X  Is  a  cut  (path)  vector  for  0D  at  level  M-m+1.  Further¬ 
more,  if  m  is  a  min  path  (cut)  at  maximum  (minimum)  level  m  for  0, 
then  N-m  is  a  rain  cut  (path)  at  minimum  (maximum)  level  M-m+i  for 


Proof :  If  X  is  a  path  vector  at  level  m  for  0,  0(X)  ,>  m.  Thus, 
0D(N-X)  =  M-0(X)  <  M-m  <  M-m+1,  and  J^-X  is  a  cut  vector  for  0^  at 
level  M-m+1.  Now  let  m  be  a  min  path  for  0.  Then  0(m)  =  m  and 
v  <  m,  4>( Y^)  <  m-1.  Thus, 

D 

0  (N— m)  —  M-0(m)  =  M— o  <C  M— m+1,  and 

<J>°( N-Y )  =  M-0(Y)  ^  M-m+1  v  Y  <  m  or  N-Y  >  Nim. 

T) 

Thus,  fl-m  is  a  min  cut  for  0  .  The  results  in  parentheses  hold  by 
considering  0°  as  the  original  MSF  and  remembering  that  (0D)D  *  0.  0 

Corollary  2.7:  If  0  is  a  k-out-of-n  system  at  level  m,  0°  is  a 
(M-k+l)~out-of-n  system  at  level  M-m+1. 

Proof:  Since  a  k-out-of-n  system  is  uniquely  determined  by  its  min 

paths  or  min  cuts,  this  is  immediate  from  Theorem  2.6.  □ 

Ex^ple  2.7:  Let  0  have  3  components  with  4  states  each  as  in  Example 
2.5  -  parallel  at  level  1,  2-out-of-3  at  level  2,  and  series  at  level 
3.  Then  0^(X)  «*  0(X) ,  i.e.,  the  system  is  self-dual.  It  is  easy  to 
see  that  Theorem  2.5  and  Corollary  7,7  are  satisfied.  □ 

Example  2,8:  Let  0  oe  a  series  -vstera  composed  of  3  ternary 
components. 
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0D(0,0,0)  =  0 


4>°<  1,1,1)  -  Ao.1,1)  =  *D(0,0,i)  =  1 

4>D(2 ,2,2)  =  ♦“(1,2,2)  =  <t>°(  1,1,2)  =  <t>D(0,2,2) 

-  4>D(0, 1 ,2)  =  4>D(0,0,2)  =  2. 

Thus,  is  a  parallel  MSF.  □ 

2.2.5  Modules 

A  module  is  essentially  an  assembly  of  components  which  can 
itself  be  treated  as  a  component.  Modules  are  useful  for  breaking  up 
large  system  into  several  smaller  ones  which  may  be  more  readily 
analyzed.  They  can  also  be  used  to  determine  bounds  on  system 
reliability  which  are  as  good  as  or  better  than  bounds  obtained  by 
considering  the  original  system.  The  concept  of  a  module  is  easily 
generalized  to  the  multistate  case.  Let  (C,>  )  denote  a  set  of  compo 
nents  C  and  a  MSF 

Definition:  (A,x)  is  called  a  module  of  ( C ,  d>)  if  A  c  C  and 

AC  AC 

$()C)  »  4>(x(x  ),  X  )  where  4)  is  a  MSF.  Note  that  the 

number  of  system  states  for  x  must  be  equal  to  the  number  of  states 

for  the  first  component  of  <1. 

Definition :  A  modular  decomposition  of  the  system  (C,4>)  is  a  set  of 

disjoint  modules  {(A^ , x^) » • • • .(A^, Xf) )  together  with  an  organizing 
struc ture  ()>  such  that 

r 

(1)  C  »  u  A  and  A  nA«{$}*i  and  j,  and 

4- I  1  1  J 
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(2)  <KX)  =  4-Cx^X  ) . X,.(X  ))  . 

Parts  (i)  ,  (ii),  and  (ill)  of  Theorem  2.8  are  due  to  Griffith  [1980]. 
Theorem  2.8:  Let  (A,x)  be  a  module  of  (C,$),  i.e., 


<KX)  =  (Kx(XA).XA  ). 


(i) 

If 

X 

and 

are 

both 

coherent,  $  is  coherent. 

(ii) 

If 

X 

and 

are 

both 

strong  coherent,  $>  is  strong  coherent. 

(iii) 

If 

X 

and 

4- 

are 

both 

weak  coherent,  d>  is  not  necessarily 

weak  coherent. 

(iv)  If  4;  is  coherent  and  x  is  weak  coherent,  $  is  weak 
coherent. 

Proof :  Consider  a  component  i.  If  i  t  A,  the  theorem  is  obvious, 
so  assume  i  e  A. 

(i)  Since  x  is  coherent,  there  exists  X  such  that 

xUj-D^)  <  x(jitxA).  Since  (V  is  coherent,  there  exists 

AC 

such  that 

A  AC 

<K(j-0.,x)  =  <K x((j-Di»2i  ),x  ) 

<  <Kx<ji,2CA),xA  )  -  <K j i . x)  . 

(il)  Since  x  is  strong  coherent,  there  exists  such  that 

x(jrxA>  =  j  and  x(A1,xa)  I4  j  *  *  i  j .  Since  4>  is 
coherent,  there  exists  XA  such  that 
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j  ,  and 


(ill) 


(iv) 


Theorem 
with  Xj 


♦(Jj.x)  =  <Kx(ji,xA),xA  )  -  )  = 

♦(^.x)  =  <Kx(\.xS,XA  )  +  j  *  *  *  j  . 

The  problem  here  is  that  there  exists  XA  such  that 

A  A  A^ 

XtN^.X  )  >  X(0  X  ),  and  there  exists  X  such  that 

AC  AC  A 

)  >  <KOt,X  ),  bat  x(Nt.X  )  is  not 

necessarily  equal  to  M  .  This  problem  exists  even  if  x  is 

coherent  or  strong  coherent.  As  an  example,  let  x(Xj,X2) 

=  <KXl  ,Xj  be  ternary  systems  with  ternary  components. 

<K0,0)  =  <1/(1 , 0)  =  0 

4.(0, 1)  =  4.(1, 1)  =  4<(0, 2)  =  4.0,2)  =  1 

4.(2, 0)  =  4.(2,!)  -  4.(2, 2)  =  2  . 

Note  that  states  0  and  1  of  component  1  are  indistinguish¬ 
able.  Let  ^(Xj.X^.X^)  *  4.(x(X|  ,X^)  ,X^)  .  Enumeration  shows 
that  4(Xl,02,X3)  =  4>(X1,22,X3)  v  x3  and  X3.  Thus,  compo¬ 
nent  2  is  irrelevant,  so  $  is  not  weak  coherent. 


Since  x  is  weak  coherent,  there  exists 


xA 


such  that 


A  A 

X(0,)C  )  <  x(N^  X  ).  Since  4.  is  coherent,  there  exists 
AC 

X‘  such  that 

C  C 

4>(0i,X)  -  4.(X(01,XA),  X*  )  <  4-(x(Ni,XA),XA  ) 

=  ♦(N±,X)  .  0 


2.8  clearly  applies  to  a  modular  decomposition  by  replacing  x 
,  •  •  • ,  X  • 
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Example  2.9:  Consider  <KXj  .X^.X^.X^)  where  everything  is  ternary,  X^ 
and  X^  are  In  parallel,  X^  and  X^  are  in  parallel,  and  X^  -  X^ 
are  in  series  with  X^  -  X^.  This  system  is  shown  in  Figure  2.1. 


Figure  2.1  Block  Diagram 

It  is  easy  to  see  that 

^VWV  3  mln  (X1  V  X2’X3  V  V 

=  min  (xJ(Xl,X2),  x2(X3,X4))  =  (Kx^XpX^,  xU^)) 

where  x  is  a  parallel  structure  function  and  ^  is  a  series  structure 
function. 

As  a  sample  calculation, 

♦(2,0,1, 1)  -  <K  x(  2 , 0)  ,  X(l.l))  -  4>(  2,1)  -  l.  □ 

The  next  theorem  shows  that  the  dual  of  a  module  is  a  module  in  the 
dual. 
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Theorem  2,( 
$D(X)  -  4>Di 

Proof : 


>:  If 


:  xD(xA) ,xA 


is  a  MSF  and 

). 


(A,x)  is  a  module  of  (C 


AxVS,XA  )  -  4>D(Mx-X(/-XA)  ,XA  ) 

c  c 

-  M  -  (KxO^-xV^  -XA  ) 

,  M  -  $(N-X)  -  <fD(X)  .0 


1 


,  ♦),  then 
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3.  STOCHASTIC  RESULTS 


Although  the  definition  of  coherence  is  important  in  engineering 
considerations,  most  of  the  stochastic  results  for  the  binary  case  can 
be  extended  to  the  multistate  case  assuming  only  that  the  structure 
function  is  nondecreasing  in  each  component.  Many  of  the  multistate 
analogues  to  binary  results  exist  in  the  literature  although  theorem 
hypotheses  are  generally  stronger  than  necessary.  These  theorems  have 
been  extended  to  the  general  multistate  model  described  in  Chapter  2 
with  hypotheses  weakened  when  appropriate.  The  proofs  of  these  theorems 
usually  require  only  minor  modifications  to  the  proofs  given  for  more 
restrictive  situations.  Some  results  which  had  not  previously  been 
extended  to  the  multistate  case  also  appear  in  this  chapter. 

Results  pertaining  to  system  utility  are  contained  in  Section  3.1. 
These  are  important  because  reliability  is  thought  of  as  the  probability 
that  the  system  is  operational.  Since  the  system  can  be  partially 
operational  in  the  multistate  case,  expected  system  utility  is  a  better 
measure  of  system  performance  than  is  reliability.  In  Section  3.2  the 
relative  importance  of  each  component  to  the  system  is  considered.  Many 
of  the  reliability  importance  measures  in  this  section  had  not  been 
previously  extended  to  the  multistate  case.  It  is  known  that  certain 
classes  of  life  distributions  are  closed  under  convolutions,  mixtures, 
and  the  formation  of  coherent  systems.  By  extending  the  idea  of  life 
distribution  classes  to  life  distribution  processes,  these  closure 
theorems  are  shown  to  apply  to  the  multistate  case  in  Section  3.3.  The 
results  for  closure  under  the  formation  of  coherent  systems  previously 
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existed  in  the  literature,  but  the  results  for  closure  under 
convolutions  and  mixtures  are  new.  Reliability  bounds  on  both  system 
and  component  reliability  are  considered  in  Section  3.4.  Butler  1 1978] 
extended  most  of  the  bounds  on  system  reliability  to  the  multistate  case 
using  the  probability  that  the  system  is  at  a  certain  level  or  higher  as 
a  measure  of  reliability.  In  Section  3.4.2  these  bounds  are  shown  to 
apply  to  expected  system  utility.  In  addition,  bounds  on  the  expected 
utility  of  non-coherent  systems  are  obtained.  Bounds  on  component 
reliability  are  usually  based  on  properties  of  the  life  distribution 
class  to  which  the  component  belongs.  Using  life  distribution  proces¬ 
ses,  these  bounds  are  applied  to  multistate  components  in  Section  3.4.3. 

3.1.  System  Utility 

In  Section  2.1,  the  reliability  of  the  system  at  level  k  was 
1c  k 

defined  as  h  =  P($(X)  k).  Note  that  h  i  E<KX)  as  it  was  in  the 

binary  case.  In  the  multistate  setting,  E<KX)  =  P(  <t>(X)  k)  « 

M  k.  k 

^•k=l  h  ‘  However,  neither  h  nor  E4>(X)  is  necessarily  the  best 
measure  of  system  performance.  It  is  possible  that  system  states  j 
and  j+L  have  nearly  identical  utility  while  there  is  a  large 
difference  in  the  utility  of  states  k  and  k+L.  The  measure  of  system 
performance  that  will  be  used  in  this  thesis  is  expected  system  utility. 

Let  U(*)  be  a  utility  function  which  assigns  value  to  system 

state  k,  k  -  l,  2,  ....  M  where  a^  =  0  without  loss  of  generality. 

««»>  ■  j,  Vuop-H-  X  bk‘(*(x)>k) 

where  \  -  V  "  Vi  ■ 
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M  M 

EU(  <K  X) )  =  l  a.  P(  <K  X)  *  k)  -  I  b.h  . 
k-1  k  k-1 


When  *  k  k,  EU(  $00 )  -  E$(X).  When  a.  =  0  -v  j  <  k-1  and 

k 

-  1  *  j  k,  EU($(X))  ■  h  .  Thus,  expected  utility  includes 
expected  value  and  reliability  as  special  cases.  If  $(X)  is  weak 
coherent,  0  =  a0£ai£  •••  <_  an°*  \  2  0  ^  k.  If  $(X)  is 

coherent,  0=a0<a1<***<aM  and  >  0  ■v  k. 


Theorem  3.1: 


a . 

EU(  4>(  X)  )  =  I1  p..EU($(j  ,X)) 
j=0  1J  1 


Proof : 


M  N 

EU(  $(_X) )  -  l  bk  r  p  P($(X)  >  k|x  -  j) 
k-1  j=0  J 


N. 


M 


f  I  b  P(*(j  ,X)  >  k) 

j=0  k-1 


N 

I 

j-0 


r  Pij  EU(<.(j1,X))  .  D 


Using  Theorem  3.1,  the  following  pivotal  decomposition  can  be  derived 
where  the  first  equality  holds  only  for  independent  components. 

N  N  M 

euu(x))  -  x  •••  r  pH  •••  pn1  i  \  i(*(i  a  )>k> 

j  -0  j  -0  ljl  nJn  j-0  k  mjl . 

J  l  n 


I  P(X  -  x)  l  b  1 


j-0 


k  {$(x)>k}  * 
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r 

0 

Theorem  3.2:  If  |  Is  a  coherent  MSF,  then 

(1)  EU($(X  v  Y))  >  [EU($(X))1  v  [EU($(Y))1  where  equality  holds 
if  and  only  if  $  is  a  parallel  structure  function. 

(ii)  EU(<KX  ay))  <  IEU(4>(X))]  a  [EU($(Y))]  where  equality  holds 
if  and  only  if  $  is  a  series  structure  function. 

Proof:  (i)  From  the  pivotal  decomposition, 

P($(X  v  Y)  >  k)  -  P($(X)  >  k)  v  P($(Y)  >  k) 

x  y  ^ 1  ^  1  (4>(x)  v<Ky  )>k}^  P^-  — ^ 

From  Theorem  2.2,  the  quantity  in  brackets  is  always  nonnegative  and  is 
0  if  and  only  if  <p  is  a  parallel  structure  function.  The  result 
follows  by  multiplying  by  b^  >  0  and  summing  over  k. 

(ii)  The  proof  is  similar.  Q 

The  next  result  shows  that  expected  system  utility  is  increasing  in 
q^  (recall  q ^  =  P(X^_>  j).  If  $  is  coherent,  then  the  expected 
system  utility  is  strictly  increasing  in  q^ . 


Theorem  3. 

_3: 

Let 

x(xo 

have 

distribution  q(q.' )  • 

(i) 

If 

3  < 

<^'  and 

$ 

is  weak  coherent,  EU($(X)) 

EU($(X')). 

(ii) 

If 

a.  < 

q '  and 

$ 

is  coherent,  EU($(X>)  <  EU(<p(X')). 

Note:  £  < 

:  a' 

is 

the  same 

as 

saying  X  is  stochastically 

smaller  than 

X'  or  X 

1st 

X*. 

Proof :  (i)  Since  £  <.  3.'  and  $(X)  is  increasing  in  X» 


1 


P($(X)  >  k)  <  P($(X')  >  k)  *  k. 


The  result  follows  by  multiplying  by  0  and  summing  over  k. 

(ii)  Since  ^  <  £’ ,  P(4>(X)  >  k)  <  P(  <J>C  X'  )  >  k). 

Inequality  must  hold  for  some  k  since  $  is  coherent,  and  the  result 
follows  by  multiplying  by  b^  >  0  and  summing  over  k.  To  see  that 
coherence  implies  inequality  for  some  k,  assume  ^  and  q^'  differ 
only  in  the  ith  row. 


i 

PU(X)>k)=  p..PU(j  .  ,X)  >  k) 
j  =  0  1J 


where 


t  >  k)  -  P(<K(j-l)  ,X)  >  k)  J 

j=o 


P((K-11,X)  >  k)  H  0 


N. 

l  qti  [PC  4>Cj  ±  ,x* )  >  k)  -  P(4>((j-I)i,X')  >  k)] 


j  =  0 
N 

l 

j=0 

by  coherence 
=  P(KX')  >  k)  .  0 


<  r  qJ.lPCKL.X’)  >  k)  -  P(K(j-l).,X’ )  >  k)l 

_  n  Ij  J-  ■** 


Theorem  3.3  is  a  special  case  of  the  economic  theory  of  stochastic 
dominance  and  utility  functions.  See,  for  example,  Hadar  and  Russell 
[1969]. 


3.2.  Reliability  Importance 

When  deciding  whether  or  how  to  improve  a  system,  it  is  useful  to 
know  where  a  given  improvement  would  do  the  most  good.  Obviously  a 
budget  constraint  is  necessary,  but  a  good  heuristic  approach  to 
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Improving  system  reliability  is  to  start  with  the  component  that  yields 

the  greatest  increase  in  system  reliability  for  a  given  improvement  in 

component  reliability.  Several  measures  of  component  importance  have 

been  suggested.  A  measure  for  which  component  probability  distributions 

are  not  necessary  is  called  structural  importance  while  reliability 

importance  is  the  term  given  to  a  measure  which  involves  those 

distributions.  The  first  definitions  of  structural  and  reliability 

importance  were  proposed  by  Birnbaum  {1969].  These  measures  and  their 

attributes  have  been  extended  to  the  multistate  case  by  Butler  [1979] 

and  Griffith  (1980]  in  different  ways.  The  theorems  and  proofs  in  those 

papers  have  been  slightly  modified  to  fit  the  more  general  multistate 

case  presented  in  this  thesis.  Other  importance  measures  are  due  to 

Barlow  and  Proschan  [1975b]  and  Fussell  [1976],  These  measures  and 

their  attributes  are  herein  extended  to  the  multistate  case.  For  a 

complete  discussion  of  binary  reliability  Importance,  see  Lambert 

[1975].  Throughout  this  section,  it  is  assumed  that  the  components  are 

lc  k 

independent  so  that  h  =  h  (P)  and  upgrading  one  component  will  not 
affect  other  components. 

Definition:  The  Birnbaum  Reliability  Importance  of  component  i  in  a 
binary  system,  denoted  is  defined  by 

Ih(i,P>  h(llvP)  -  h(0i,P) 
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Definition:  (Butler  (1979]).  The  r,s  reliability  importance  of  compo¬ 
nent  1  at  level  m  is  defined  by 

l^'8  (i.P)  -  hm(ri,P)  -  hm(Si,P)  -  P(*(X)  >  m|xt  -  r) 

where  0£s<r£Ni. 


Theorem  3.4:  (Butler  [1979]).  Let  +  6^  +  •••  +5  =*  0,  and 

let  0  <  _<  1  -v  k,j.  Define  P  by 

.  K  1£  l'k 

P  .  .  =  / 

( +  6j  l£  1  ■ k  • 

Then, 

h(p)  -  h(p>  +  ^  ihK  (k,p>  +  •••  +  Sji^V.p)  . 
k 

Proof : 


h(P)  +  6, 


A*° 


N, 


(k.P)  + 


+  6^’°  (k.P) 


-  Xjio  Pkjh<VI>  +  6NkIh(Nk.P)-h(Ok,P)]  +  •••  +  61(h(lk,P)-h(0k,P)] 

N  N 

^j-1  (pkj  +  5j)h^k*P)  +  pk0 ^ °k ^  “  h^°k’-^  Ij-1  6j 

-  Ij.o  <PkJ  +  «j)WJk.P)  -  h(Ok.P)  I^0  ts 

■  ■  *,  pkj«Jk-i>  ■  h<i> 
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where  the  last  line  follows  since  P  and  P  differ  only  in  the 


kth  row.  Q 

A  AC 

Theorem  3.5:  Let  4>(X)  =  <Kx(X  ) ,  X  ),  let  i  £  A.  and  let  x  be 
a  binary  MSF.  Then 


I 


(i) 


(1)1 


r  ,s 


(i)  • 


Proof : 

C  C 

Vcrt> "  V‘i’2*  )Vri-iA)  +  >' 

c  c 

h^.P)  =  h^d,/  )hx(Si,XA)  +  h(|)(0j  >XA  )ll-hx(s..XA)] 

Ih’S(i>  =  ~  V8i’I> 

<1> 

c  c 

=  [  h^(  1 1  ,  XA  )  -  )][hx(r.,XA)  -  hx(Si,XA)] 

=  lJ/°(L)  ij’fi)  •  □ 

^  X 

Definition :  X  is  a  critical  vector  for  binary  component  i  if 

4>(1  ,X)  -  4>(0  ,X)  *  1.  Let  nfi)  be  the  number  of  critical  vectors 
i  —  i  — 

for  component  i.  The  Birnbaum  Structural  Importance  of  component  i 
is 


IJi) 


—  I  NO,  .X)  -  «0,  ’*>1 

2  X 
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Definition:  (Butler  [1979]).  X  is  an  r,s  critical  vector  at  level 


m 


for  component  1  if  >  m  and  (J>( s ^ ,  X)  <  m,  or 

l(<Kri,X)>m}  '  1{4.(s.,X)>m}  “  1  * 

r  s 

Let  n  ’  (i)  be  the  number  of  r,s  critical  vectors  at  level  m 
for  component  i.  The  r,s  structural  Importance  at  level  m  for 
component  i  is 


I^,S(i)  i  [ (N+1)(N+1)  •••  (N  +1)]  1  nr’S(i)  . 
$  12  n  $ 

Theorem  3.6:  (Butler  [1979]). 


(i) 

N  O  N  N  -1 

lh  (i)  *  lh  (i)  +  •••  + 

lj’°(i) 

(ii) 

N.,0  N. ,N  -1 

V  =  1/  U>  +  + 

lJ,,U(i) 

4> 

(iii) 

I^,S(i,[l/N])  =  lJ’S(i)  where 

[  1  /  N  ]  means 

Pij  = 

l/o^+l)  V  i,j 

Proof : 

(i) 

N  .0 

\  (i)  -  h(NltP)  -  h(0itP) 

=  [h(NifP)  -  h((Nl-l)i,P)]  +  •••  +  [h(li,P)-h(Oi,P)] 

W1  1  0 

-  IhX  1  (i)  +  •••  +  lJ*U(i) 
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(11)  %  (1)  “  l  ^U(Ni#X)  >  m)}  "  1{0(01,X)  >  m)  ^ 

=  X  >  m>  ~  l<*((Ni-l)i,X)  >  n>}^ 

+  *  “  +  ^{♦(ll,X)  >  m}  '  1{KOi,_X)  >  m)  ^ 


N  N  -1 

n/  (i)  +  •' 


1,0,  .  s 

n*  (i) 


Multiplying  by  [ ( N  +1)  •••  (N  +1)]  *  yields  the  result, 
1  n 


(ill)  h(  j  t ,  [  1/N]  )  = 


xes,x  =*j 


P(0(jl(x)  >  m)P(X  =  x) 


=  [(NI+I)»*,(N1_1+l)(Ni+1+ !)••*(  N^+l)  ] 


-1 


x«?VJ  If*(J1,i)  >  .} 


-  I  (N |  +  1 )  •••  (»1|+1>l'1  Js'(,(j1,x)  >  .} 

i'’5d. !!/»])  -  i(N1+o  ...  (Vi)r'  i  [i{  ,  >m) 

xes  i  —  — 

1  {(Ks^  ,x )  >  m }  ^ 

=  [  (N  +1)  •••  (N  +1)]_1  n*’S(i)  =  •  3 

1  n  4>  4> 


A  different  extension  of  the  Birnbaum  reliability  importance  is  due 

to  Griffith  (1980],  In  the  binary  case,  h(P)  =  h(0.,P)  +  p  I  (i,P). 

—  i  —  11  n  ~ 

Thus,  knowing  I(i,P),  it  is  easy  to  determine  the  increase  in  system 
h  — 

reliability  caused  by  upgrading  a  component.  There  is  a  vector  in  the 
multistate  case  which  is  analogous  to  I  (i,P). 
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Notation : 


Theo  rem  3.7: 

+  UD  * 

transpose)  . 


V” 


=  PCOU^X)  >  j)  -  PUUA-l^.X)  >  j) 


V4) 


I(i) 


1  W1’ 


=  (id),  •••,  i  (i» 
I  N± 


(Griffith  [1980]).  EU(«KX))=  tfO^X)  >  j) 

where  (^  =  ^qil ,qi2’ “  ' ,qiN  (t  denotes 


Proof: 

PUd^X)  >  j)pu 

-  j^qi*-qi,  Jl+1^ 

where  q  *  1  and  q.  N  =0 
1U  1,wi+l 

Ni 

=  P(4>(0.,X)  >  j)  +  1  iP(<K^»P  >  j) 

1  ~  1=1 

-P(«(H)i,X)  >  j)]qiA 


P(*(X)  >  j) 


t 

1=0 


N 

r 

1=0 
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M 

EUU(X))  =  I  b.  P(KX)  >  j) 
j-1  J 


M  N. 


I  b .  P($<0.  ,X)  >  j)  +  I  f  b .  [P($(* ,  ,X)  >  j) 
j-1  J  j-1  *=1  J 

-  P($((A-Di,x)  >  j)]qu 


M  N 

I  b  P(  4>(0  ,  X)  >  j)  +  I  I  ?( i)q  . 
j  =  l  J  1  X=1  X 


M 

=  v  b  P(<D(0  ,X)  >  j)  +  I(i)  *  q ,r  .3 
j=l  J 


Corollary  3.8:  If  component  1  Is  stochastically  improved  from 
distribution  q ^  to  q^  _>q.£>  the  change  in  expected  utility  is 


AEUU(X))  =  I(i)  •  A  where  A  -  q^  -  q  v  i, j  >  1 


Theorem  3.9:  (Griffith  [1980]).  If  a^  =  j  ,  so  that  EU(<KX)>  =  E  «J>(  X)  , 
then 


1(1)  =  C  E  <J)(  1  A  ,_X)  -  E<K01,X) . E<{>(Ni,X)  -  E$((Ni-l)i,X)>  . 


Several  other  importance  measures  are  due  to  Barlow  and  Proschan 
[1975b].  Their  raultistate  extensions  are  defined  herein. 

Definition:  The  B-P  (binary)  reliability  importance  of  component  i  is 


I__U>  =  J  [h(  1  ,F(  t)  )  -  h(0  ,F(  t)  )  ]dF  ( t) 

BP  0  i  -  1  -  1 

*  probability  that  component  i  causes  system  failure. 
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A  time  dependent  version  of  this  measure  is 

IBp(i,t)  i  (l-h(F(t))]"1  jj  {h(li,F(t))-h(Oi,F(t))]dFi(t) 
t 

/  [h(l  F(t))  -  h(0  ,F(t))]dF  (t) 

0  1  11 
a  — . - - —  ■ .  —  -  - .  ■-  —  — 

l  /  (h(l  F(t))-h(0  ,F(t)))dF  <t) 
i=  l  0  X 

=*  P  (component  i  causes  system  failure  )  system  is 
failed  by  time  t) 

The  second  equality  above  follows  from  the  fact  that  if  the  system  is 
failed  by  time  t,  one  of  its  components  must  have  been  the  source  of 
that  failure. 

Definition:  The  B-P  multistate  reliability  Importance  at  level  m  of 
component  i  is 

N  -l  - 

1“  (i)  =  I1  /  (hm( (k+l) . ,£(  t) )-hm(k  ,F( t) ) JdFyX  t)  . 
k=0  0  1  1 

Recall  that  F^(t)  -  P(X1( t )  <  k)  . 

Each  term  in  the  above  sum  is  the  probability  that  a  transition  of 

component  i  from  state  k+I  to  k  caused  the  system  state  to  drop 

below  tn.  InD(i)  is  the  probability  that  some  transition  of  compo- 
or 

nent  i  caused  the  system  state  to  drop  below  m.  Letting  P  -  F(t), 
we  can  relate  the  two  multistate  reliability  importance  measures  via 

“1 1  /  lJ;+l,k(i,F(t))dF5-(t)  . 

k-0  0  "  1 
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The  time  dependent  multistate  version  of  this  measure  is 


I« „(i.t)  =  [ l-hm(F( t) ) ) _1  ^  J  [hm((k+l)  F(t))-hm(k  F(t))]dFk(t) 

k=0  0  l 

N  -1  t 

I1  /  [hm((k+l)  F(t))-hm(k. ,F(t))]dF^(t) 


n  n  -l  t 

I  I  I  [hm((k+l>  F(t))-hm(k.  ,F(t))ldF.k(t) 

t  _  i  i _ n  n  *>  1 


N  -1  t 

I  / 

k-0  0 

n  N  -1  t 

t  f  I 

i= 1  k=0  0 

P  (component  i  caused  the  system  state  to  be 
<  m  |  system  state  at  time  t  is  <  m) 


Note  that  =  !gp(i)  • 

A  AC 

Theorem  3.10:  If  $(X)  =  iKx(X  )  ,X  ),  i  e  A,  and  x  is  a 


binary  MSF,  then 


(i)  l“p(i)  -  I1  1  /  1>  I^+1,k 


k=0  0  (p 


U)dF^(t) 


(ii)  l“  (M)  =  £  i”  (i)  where  M  is  the  module  x  of  $ 

BP  i€A  BP 


Proof : 


(i)  Letting  £  =  £(t),  this  is  an  immediate  corollary  to  Theorem 
3.5. 

(ii)  Since  failure  of  the  module  at  time  t  must  be  caused  by  a 
transition  of  one  of  its  components  at  time  t, 
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r 

t 


1 


dF  (t)  -  l  £*  \h  ((k+l>  X*)  -  h(k  XA)]dFk(t) 

M  ieA  k-0  X  1  “  X  1  ~  1 

where  M  means  module 

N  -1  L.  t 

-  I  I1  Ik+1*k(i)dFk(t) 

IcA  k=0  X 

^p(H)  -  J  lJ,-WH(t) 

0  4> 

=  /  l!*°(M)  l  ll  Ik+1,k(i)dFk(t) 

0  n<ti  icA  k»0  nx 

=  l  Igp(i)  using  part  (i)  .  Q 

ieA 


The  same  idea  can  be  applied  to  determine  the  reliability  importance 
of  a  cut  set  instead  of  a  single  component. 


Definition:  The  B-P  (binary)  cut  set  importance  of  cut  set  is 

<*>  K.  -  {i  } 

IRP0O  =  l  J  h(  1  , 0  d  ,  F(t)>  n  F  (t)dF  (t) 

J  icfi.  0  1  JUK  -  {i  >  *  1 

J  j 

“  P  (failure  of  cut  set  K  causes  system  failure) 

Definition:  The  B-P  multistate  cut  set  importance  of  cut  set 
(level  m)  is 


n  • 


i”  (mj+l)  =  1  j  hm((iJ)  Fm(t))[  n  F“(t)]dF"(t) 

i-1  0  Wi 


where  F^(t)  ■  P(X^(t)  <  m^)  and 


^((iJ+Dj.^t))  -  P(*(X(t))  >  mlX^t)  -  mj+l.X^t)  <  Sj  v  t  +  i) 
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Note  that  component  A  may  be  unimportant  In  a  particular  cut  set  In 
which  case 


and  F^(t)  “  1  v  t 

Definition:  The  B-P  (binary)  structural  importance  of  component  i  is 

CO 

I  (i)  =  J  [h(l  ,F(t))-h(0  F(t))]dt 

s  o  1  1 

=  /"[h(l  ,P)  -  h(0  ,P)]dP 
0 

where  F^(t)  =  Fk(t)  =  P  v  j,k  . 

The  B-P  structural  importance  of  component  i  is  derived  by  letting 
all  components  have  the  same  life  distribution  and  averaging  over  that 
distribution.  This  importance  measure  is  more  difficult  to  extend  to 
the  multistate  case  since  the  life  of  a  component  is  not  well  defined. 
One  possible  extension  is  contained  in  the  following  definition. 

Definition :  Let  all  components  and  the  system  have  state  space 
{0,l,...,M}.  The  B-P  multistate  structural  importance  at  level  m  of 
component  i  is 


l"(i)  =  I1  I  /  [hm(k+l)  FX(t))-hm(k  F\t)JdF*(t) 

J  k=0  A-0  0 

where  F*(t)  -  P(Xj(t)  <  A)  -  P(Xk(t)  <  A)  «  j,k  . 

An  Importance  measure  which  is  used  in  fault  tree  analysis  because 
it  is  easy  to  calculate  is  described  in  Fussell  [1976]. 
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Definition:  Let  1— h^CjF )  =  P(4>^(X(t))  «  0)  -  P  (all  components  in  a 
cut  set  containing  i  are  failed  at  time  t) 

♦J(X(t))  s  5*  U  XjCO 

j  =  l  AcK 

i 

where  is  the  number  of  cut  sets  containing  component  i. 

The  V-F  (binary)  reliability  importance  of  component  i  is 

1^(1)  =  1 1-h^F)  )/ 1  l-h(F)  } 

=  P  (component  i  is  contributing  to  system 
failure  |  system  failure  by  time  t) 


Definition:  Let 


l-h*,m(F) 


P(^,m(t) 


1) 


*  P(X^(t)  v  k  in  a  cat  set  at  system 

level  m  for  which  mi  =  A) 

N*’m 


«*f’“(t>  E  if  S  1 

j-1  k-1  {^(t)^} 


where  N*,m  =  the  number  of  cut  sets  at  system  level  m 
for  which  m^  »  A 
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The  V-F  multistate  reliability  Importance  at  system  level  m  of  compo¬ 
nent  1  at  level  l  is 

ljp(i>  *  [l-hJ'm(F)]/[l-hm(F)]  . 

Definition :  The  V-F  (binary)  reliability  importance  of  cut  set  K.  is 

UCK  )  =  n  F  (t)/[  l-h(F)) 

W  J  icK  1 

=  P  (cut  set  K.  is  contributing  to 

system  failure  at  time  t  |  system 
failure  by  time  t) 

Definition ;  The  V-F  multistate  reliability  importance  of  cut  set  nr' 
is 

.  n 

)  =  n  F,  l(t)/[l-hm(F>]  . 

i-1 

The  V-F  importance  measures  are  all  concerned  with  the  probability 
that  a  component  is  contributing  to  system  failure.  Another  considera¬ 
tion  is  the  probability  that  a  component  is  critical  to  system  failure, 
i.e.,  that  repairing  that  component  will  allow  the  system  to  function. 

Definition:  The  criticality  Importance  of  component  i  is 

I  (i)  s  [h(l  ,F)  -  h(0  ,F)jF(t)/(l-h(F)J 
c  1  —  i  —  i  — 

■  P  (component  i  is  failed  at  time  t 

and  repairing  it  will  restore  the 

system  |  system  failure  by  time  t) 
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Definition:  The  r,s  criticality  Importance  at  system  level  m  of 


component  1  is 


I^’S(1)  H  [hm(r1,F)-hm(si,F)lF®(t)/[l-hm(F)] 

»  P(Xi(t)  £  s,  and  X^Ct)  r  would  make 
<KX(t))  >  m  |  4>(X(t)  <  m)  . 


3.3.  Closure  Theorems 

Several  categories  of  life  distributions  were  defined  in  Section 
1.1.  An  obvious  way  to  extend  these  categories  to  the  multistate  case 
is  to  replace  the  component  life  distribution  F^Ct)  by  F.(t) 

■  P(Xi(t)  k) .  The  following  definition  is  slightly  more  general. 

Definition:  Let  (X^t),  t  0}  be  a  real  valued  stochastic  process, 

and  let  T^  =  inf  {t  :  X^(t)  <  a}  be  the  hitting  time  of  a.  X^t) 
is  said  to  be  an  I FRA  (IFR,  DFRA,  DFR,  NBU,  NUU ,  NBllE ,  NWUE)  stochastic 
process  if  T^  is  an  IFRA  (IFR,  etc.)  random  variable  v  a.  The 
process  X^(t)  will  usually  represent  the  state  of  the  ith  component. 
This  definition  allows  X^(t)  to  be  a  continuous  random  variable. 

There  are  several  reliability  operations  for  which  closure  theorems 
have  been  developed  in  the  binary  case.  One  type  of  reliability 
operation  which  has  been  previously  discussed  is  the  formation  of  a 
coherent  system  from  several  components.  A  second  is  convolution  which 
yields  the  distribution  for  a  sum  of  random  variables.  This  is  used  to 
get  the  distribution  for  the  sum  of  the  life  lengths  of  a  component  and 
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several  spares.  Another  reliability  operation  is  mixture  of  distribu¬ 


tions  which  is  useful  when  a  component  may  have  one  of  several  life 
distributions.  This  could  occur  if  several  manufacturers  produce  the 
same  type  of  component,  and  it  is  not  known  which  manufacturer  produced 
the  operating  component.  A  closure  theorem  for  one  of  these  reliability 
operations  states  that  if  the  input  distributions  all  belong  to  a 
certain  category,  then  the  output  distribution  also  belongs  to  that 
category.  The  following  table  shows  which  categories  are  preserved 
under  each  of  the  three  aforementioned  reliability  operations. 


Reliab 

.lity  Operation 

Life  Dis- 

Formation 

Convolu  tion 

Arbitrary 

Mixtures  of  Dis- 

tribution 

of  Coherent 

of 

Mixtures  of 

tributions  that 

Class 

Systems 

Distributions 

Distributions 

do  not  cross 

IFR 

NP 

P 

NP 

NP 

IFRA 

P 

P 

NP 

NP 

DFR 

NP 

NP 

P 

P 

DFRA 

NP 

NP 

P 

P 

NBU 

P 

P 

NP 

NP 

NBUE 

NP 

P 

NP 

NP 

NWU 

NP 

NP 

NP 

P 

NWUE 

NP 

NP 

? 

P 

P  ■  preserved  NP  »  not  preserved  ?  =  unknown 


TABLE  3.1  Closure  Theorems 

The  proofs  of  the  results  in  Table  3.1  may  be  found  in  Barlow  and 
Proschan  [1975a].  The  binary  counter-examples  to  preservation  of  life 
distribution  classes  presented  therein  obviously  apply  to  the  multi- 
state  case.  It  will  be  shown  that  all  of  the  closure  theorems  which 
hold  in  the  binary  case  have  analogues  in  the  multistate  case.  Not 
surprisingly,  the  main  technique  used  to  prove  the  multistate  closure 


theorems  Is  the  reduction  of  the  multistate  case  to  the  binary  case. 

The  following  theorem  is  due  to  Ross  (1979). 

Theorem  3.11  ( IFRA  Closure):  Let  (X,(t),t  >  0)  be  independent  IFRA 
stochastic  processes,  and  let  4>  be  a  MSF.  Then  {$(^(0),  t  ^  0}  is 

an  IKRA  stochastic  process. 

In  this  theorem,  $  does  not  need  to  be  a  multistate  function.  It 
could  be  continuous,  negative,  etc.  ,  -  just  as  long  as  it  is  increasing 
in  its  arguments.  This  is  true  for  all  the  theorems  in  this  section. 

The  next  theorem  is  due'to  El-Newe ihi,  Proschan,  and  Sethuraman  [1978]. 

Theorem  3. 12  (NBU  Closure):  Let  (X^(t),  t  ^  0)  be  independent  NBU 

stochastic,  processes,  and  let  4>  be  a  MSF.  Then  {<KX(t)),  t  0}  is 

a  NBU  stochastic  process. 

Definition :  Let  T  and  be  random  variables  with  distributions 

Fj  and  F^.  T^  +  T^  has  distribution  F(t)  *  F^  ( t-xJdF^Cx) 
which  is  called  the  convolution  of  F^  and  F^  and  is  denoted  F^  *  F^. 
In  reliability  T^  and  T^  are  usually  the  life  lengths  of  components. 

Theorem  3. 13:  Let  {X^(t),  t  _>  0}  and  {X  (t),  t  >_  0}  be  independent 

IFR  (IFRA,  NBU,  NBUE)  stochastic  processes  corresponding  to  two 

components.  Let  k  and  l  be  the  respective  states  for  which 

k  1 

components  1  and  2  are  considered  failed.  Then  T  =  +  T^  is  an 

IFR  (IFRA,  NBU,  NBUE)  random  variable. 

Proof :  Since  X^(t)  and  X^Ct)  are  IFR  stochastic  processes,  Tj  and 
l 

T2  are  IFR  random  variables.  The  result  is  then  immediate  from  the 
binary  case  (see  Theorem  4.2  of  Barlow  and  Proschan  [1975a]).  3 
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Theorem  3.13  could  be  considered  as  the  multistate  extension  of  the 


closure  theorem  for  convolutions,  but  it  is  really  the  same  theorem  with 
state  0  =  {0,1,..., k}  and  state  1  =  {k+l,...,M}  for  component  1  and 
a  similar  equivalence  for  component  2.  Theorem  3.15  contains  more  of 
the  flavor  of  an  operating  component  and  spares.  First,  the  following 
lemma  is  needed. 

Lemma  3.14:  Let  {X^(t),t  0 }  be  a  stochastic  process  corresponding 

to  the  state  of  the  ith  component  for  which  the  only  allowable  state 

transitions  are  from  a  state  to  the  next  lower  state.  Denote  the 

sojourn  time  in  the  kth  state  by  T,  ,,  ,  .  If  the  T,  ,  ,  are 

k  k- 1  k , k- 1 

independent  IFR  (IFRA,  NBU,  NBUE)  random  variables  v  k  =  1,  ....  , 

then  X^(t)  is  an  IFR  (IFRA,  NBU,  NBUE)  stochastic  process. 

Proof;  T*  3  Inf  (t  ;  X^t)  <  k)  -  *  —  ♦  Tk,,  k 

is  an  IFR  random  variable  since  it  is  a  sum  of  IFR  random  variables. 

Thus,  X^( t)  is  on  IFR  stochastic  process.  □ 

Now  consider  a  process  X(t)  as  the  maximum  of  several  processes 
Xj,  ...,  X^  which  represent  components.  Only  one  component,  the  one 
which  is  currently  in  the  largest  state,  is  in  operation.  Thus, 
transitions  between  the  states  of  components  not  currently  in  service 
cannot  occur,  and  the  only  allowable  transition  is  to  the  next  lower 
state  of  the  operating  component. 

Theorem  3.15;  If  transitions  T^  ^  ^  are  independent  IFR  (IFRA,  NBU, 
NBUE)  random  variables,  then  the  X(t)  process  described  above  is  an 
IFR  (IFRA,  NBU,  NBUE)  stochastic  process. 
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Proof : 


1 

T*  i  inf  {t  :  X(t)  <  k}  -  +  •••  +  Tk. 

—  1  n 

By  Lemma  3.1  A,  T  are  all  independent  IFR  random  variables.  Since 
T  is  the  sum  of  independent  IFR  random  variables,  it  is  an  IFR  random 
variable.  □ 

Definition:  Let  {X  }  be  a  set  of  random  variables  whose  index  a  is 

-  a 

a  random  variable  with  distribution  G.  The  mixture  of  X  is  a  random 

- a 

variable  with  distribution 

CD 

F(x)  -  /  F  (x)dG(a)  . 
a 

—CD 

The  F^  will  usually  represent  component  life  distributions,  and  a 
will  usually  have  a  finite  range  so  that 

n 

F(x)  -  l  F  (x)P(a  =  i) 
i=*l 

Theorem  3.16:  Let  (X^Ct),  t  ^  0}  be  independent  DFR  (DFRA) 
stochastic  processes  indexed  by  a  random  variable  a  with  distribution 
G.  Let  X(t)  be  the  mixture  of  Xff(t).  Then  X(t)  is  a  DFR  (DFRA) 
stochastic  process. 

Proof:  Let  Ta  =  inf  {t  :  X  (t)  <  a),  and  let  Fa  be  the  distri- 
-  a  a  —  a 

button  of  T^.  The  distribution  of  T3  =  inf  {t  :  X(t)  a)  is 
Fa(t)  «  /_<BFa(  t)dG(  at) .  Since  each  Ta  is  a  DFR  random  variable, 

g 

T  is  a  DFR  random  variable  from  the  binary  DFR  closure  theorem  for 
mixtures  (see  Theorem  4.3  of  Barlow  and  Proschan  [1975a]).  □ 


Theorem  3.17:  Let  {X  (t),t  >  0}  be  independent  NWU  (NWUE)  stochastic 
-  a  — 

processes  such  that  there  is  no  crossing  of  and  on  (0,®) 

*  a,  a,  a'.  Let  a  be  a  random  variable  with  distribution  G,  and 

let  X(t)  be  the  mixture  of  X  (t).  Then  X(t)  is  a  NWU  (NWUE) 

a 

stochastic  process. 

Proof :  Same  as  the  preceding  theorem  using  the  binary  NWU  closure 

theorem  for  mixtures  of  distributions  that  do  not  cross  (see  Theorem  5.7 
of  Barlow  and  Proschan  [ 1975a] 1.  D 

3 .  A .  Bounds 

It  may  often  be  difficult  or  time-consuming  to  compute  exact 
system  reliability,  especially  for  large  systems.  Thus,  it  can  be 
useful  to  have  upper  and  lower  reliability  bounds.  If  the  lower  bound 
is  sufficiently  large,  no  further  calculations  will  be  necessary  to 
satisfy  reliability  requirements.  If  the  upper  bound  is  too  low,  this 
may  be  an  indication  of  problems  in  the  system  design.  In  the 
multistate  case,  system  reliability  is  the  probability  that  the  system 
structure  function  meets  or  exceeds  a  certain  level.  Several  upper  and 
lower  bounds  on  system  reliability  are  given  in  Section  3.4.1.  These 
include  the  use  of  modular  decomposition  which  helps  make  the 
computations  feasibLe  for  large  systems  and  often  leads  to  improved 
bounds.  Bounds  on  expected  system  utility  are  discussed  in  Section 
3.4.2.  These  are  simple  extensions  of  the  bounds  in  Section  3.4.1  and 
are  useful  since  expected  system  utility  is  a  better  measure  of 
performance  than  reliability  for  multistate  systems.  Bounds  exploiting 
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the  properties  of  various  life  distributions  are  contained  in  Section 
3.4.3.  Some  of  these  bounds  are  formulated  specifically  for  multistate 
components,  and  some  are  for  multistate  systems. 

3.4.1.  System  Reliability  Bounds 

lc 

The  bounds  in  this  section  pertain  to  h  =  P(4>(X)  k).  The 

first  paper  to  consider  multistate  reliability  bounds  was  Postelnicu 
[1970].  The  results  from  this  paper  have  been  slightly  modified  and  are 
contained  in  Theorem  3.18. 

Theorem  3.18:  Let  u  (X)  =  <t>(X.,0),  and  let  v  (X)  =  <KX.,l),  Let 
- i—  i—  i  —  l  ~ 

F  1(k)  =  P(u  (X)  >  k),  and  let  F  i(k)  =  P(  v.(X)  >  k) .  If 

<t>  is  a  MSF  with  independent  components,  then 

(i)  P(p  (X)  >  k)  <  hk(P)  <  P(v.(X)  >  k)  v  i  and  k 

p  p  p  v  _v 

(ii)  F  1  *  F  2  *  •••  *  F  n(nk)  <  ti(P)  <  F  1  *  • • •*  F  °(nk) 

n  ,  n 
(iii)  ii  P(u  (X)  >  k)  <  hK(P)  <  n  P(v.(X)>k)  . 

i-1  1  1=1  1 

Proof : 

(i)  n  (X)  =  KX.,0)  <  <KX)  <  4>(X.,J_)  =  V.(X)  *  i. 

The  result  follows  immediately. 

(ii)  Summing  on  i  and  dividing  by  n  in  the  equation  from 
part  (i)  yields 

p  (X)+**H-p  (X)  v  (X)+**H-v  (X) 

“ - <  *00  <  - - -  • 

The  result  follows  from  properties  of  convolutions. 
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(ill)  Since  maxi  u(X),...,p(X)]  <  (J>(X )  <  min[  v  (X) , . . . , v  (X) ] , 

L—  n  —  —  —  —  1  —  n_ 

PCmaxln^X) . Pn(X)]  >  k)  <  *»k(P)  <  P(ninl  v  (X ),...,  v^X)  ]  >  k) . 

The  result  follows  from  the  independence  of  the  X^.  G 

Bounds  on  multistate  system  reliability  were  also  obtained  by  Barlow 
and  Wu  [1978],  El-Neweihi,  Proschan,  and  Sethuraman  [1978],  and  Hatoyama 
[1978].  These  bounds  and  many  others  are  contained  in  Butler  [1982]. 

The  rest  of  the  theorems  in  this  section,  except  Theorem  3.19,  come  from 
Butler's  paper,  and  any  proofs  which  are  omitted  may  be  found  therein. 

The  following  definition  may  also  be  found  in  Section  1.1  and  describes 
components  which  have  similar  rather  than  independent  behavior.  This 
might  be  useful  to  describe  components  in  a  common  environment. 

Definition:  Random  variables  X,,  ...,  X  are  associated  if 

COV(F(X),  A(X))  >0  *  pairs  of  increasing  binary  functions 

r  and  A  (assuming  the  covariance  exists). 
Associated  random  variables  have  the  following  properties.  Proofs  may 
be  found  in  Esary,  Proschan,  and  Walkup  [1967]  or  Barlow  and  Proschan 
[1975a] . 

(1)  Subsets  of  associated  random  variables  are  associated. 

(2)  Independent  random  variables  are  associated. 

(3)  Increasing  functions  of  associa  ed  random  variables  are 
associated . 

(4)  If  Xj . X^  are  associated  random  variables,  then 

V  X1 . Xn 


-65- 


P(X,  >  X, . X  >  X  )  >  n  P(X,  >  xj,  and 

l  1  n  n~i«l  * 


P(X,  <  x . X  <  x  )  >  n  P(XJ  <  x,). 

1—1  n  —  n  —  l  —  i 


Theorem  3.19  (Series  -  Parallel  Bounds):  Let  $  be  a  MSF  for  which 

min  (X.)  <  <K  X)  <  max  (X.),  and  let  X.,  i  =  1,  n  be  associated, 

i  i  —  —  —  i  i  i 

Then 

n  p(x  ^ m)  <  b”  <  ^  p(x  ^  m>  • 

i- 1  i=  1  1 


Proof:  Since  the  XJ  are  associated,  l  f„  v  *.  are  associated  by 
-  i  1 X . >m  > 

l— 

property  (3).  Since  min  (X.)  <  4>(X)  <  max  (X.), 

i  1  “  ~  ~  i  1 


n  n 

~  lU(X)>m}  -  ~j  1  (Xx  >m } 


Taking  expectations  and  using  property  (4)  yields 


n  P(X,  >  m)  <  P(X,  >  m,  X  >  m) 

.  ,  i  —  —  1  —  n  — 

i-  1 


<  ^  <  E[  .a 


l  -  P(X,  <  m,...,X  <  m) 

1  n 


<  1  -  n  P(X  <  m)«  H  P(X  >  m).0 
i-1  1  i-1  i 
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Theorem  3.20  (Path-Cut  Bounds):  Let  i  be  a  MSF  with  associated 


components  X^,  X^.  Then 

t  s 

n  p(<  (x)  -  i)  <  hm  <  11  p(p  (x)  -  i)  . 
j-i  J  J-i  3 

Proof :  KjCX),  <t(X)  and  pj(X)»  Pg(X)  are  increasing  func¬ 

tions  of  J(  and  thus  associated  by  property  (3).  Using  Theorem  2.3, 
the  fact  that  xr^  and  p^  are  binary,  and  property  (4)  of  association 
yields 


hm  -  p(  n  *c .  ( x) 
J-l  J" 


l)  -  P(k  (X)  >  0 . 1C  (X)  >  0) 


>  n  p(<.(x)  -  i)  . 

j“i  J 


hm  =  p(  11  p.(X)  -  1)  -  1  -  P(P.(X)«) . .  (XKO) 

i-l  1 


<  i  -  n  p(p  (x)<o>  -  11  p(p  (x)  -  i)  .  □ 

i-l  1  i-l 


Corollary  3.21  (Path-cut  bounds  for  independent  components):  If  the 
Xt  in  the  preceding  theorem  are  independent,  then 


t  n  8  n 

n  .11  P(X  >  ra})  <  hm(P)  <  -11  n  P(X  >  nrj) 

j-l  i-l  1  1  j-l  i-l 
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Proof :  From  independence 


1 

P(p.(X)  =  1)  **  n  P(X  >.mp,  and 
J  1=1 

n  n 

P(k  (X)  =  1)  =  1  -  n  P(X  <  w3 )  =  II  P(X1  >  mj)  .  0 
J  i= 1  i*  1 

Note  that  equality  does  not  necessarily  hold  in  Corollary  3.21  as  shown 
by  Example  2.7. 

Theorem  3.22  (Max-min  bounds):  Let  (j)  be  a  MSF .  Then 

max  PCp.(X)  =  1)  <  hm  <  min  P( <  (X )  =  i) 
j  J  -  -  j  J 

Proof :  From  Theorem  2.3,  p^(X)  <  l{^(X)>m}  1  v  i.j* 

The  result  follows  by  taking  expectations,  maximizing  over  j,  and 

minimizing  over  i.  0 

Corollary  3.23  (Max-min  bounds  for  associated  components):  If  4>  is  a 
MSF  and  X^,  ...,  X^  are  associated,  then 

n  n 

max  II  P(X.  >  mj.)  <  h®  <  min  J-l  P(X  >  m^ )  . 

j  i-1  1  1  j  i-1 

Lemma  3.24  (Bonferroni  Inequalities):  The  following  equations  are  true 
for  arbitrary  events  E^,  ...,  E^. 


n  n-1  n 

(ii)  P(E  UE  o  ...  UE  )  >  l  P(E  )  -  l  l  P(E  n  E,  )  . 

j-1  3  j-1  k-j+1  3  k 

Theorem  3.25  (Bonferroni  Bounds):  Let  $  be  a  MSF. 

(1)  hm  <  P(p(X)  =  1). 

(ii)  hm  >  l*ml  P(p  (X)  -  1) 

-  £i  S-j+i  p<  <*>  ■  v»  ■  »• 

(iii)  hm  >  1  -  Jjt_1  P(ie  (X)  -  0). 

(iv)  hm  <  1  -  Ej.j  P(ie  (X)  -  0) 

+Ipl  Xj_j+1  P<^(X)  -  <k(X)  -  0). 

Proof :  Apply  the  Bonferroni  inequalities  to 

hm  -  P({pl(X)  -  1}  U  ...  u  (Ps(X)  -  1»  -  1  -  P(0>(X)  <  m) 

-  1  -  P({<l(x)  =  0)  u  ..*  u  {k  (X)  -  0>)  .  □ 

Note  that  the  idea  in  Theorem  3.25  could  be  extended  by  adding  more 
terms  in  the  Bonferroni  inequalities,  e.g. , 

P(E  u  ...  UE  )  <  l  P(E  )  -  n~l  ]  P(E  nE  ) 

n  j-1  J  j-1  k-j  +  1  3  * 

n-2  n-1  n 

+  l  l  l  P(E  «E  UE  )  . 

j-1  k-j  +  1  Jt-k+1  3 
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Corollary  3.26  (Bonferroni  bounds  for  Independent  components) :  If  # 


Is  a  MSF  and  X  ,  ....  X  are  independent,  then 


s  n 

(i)  hm(P)  <  l  n  P(X  >mj)  . 

j-1  i=l 

s  n  ,  s-I  s  n 

(H)  hm(P)  >  l  n  P(X.  >m])~  l  l  n  P(X,  >  max 

j=l  i=l  j=l  k=j+I  i=l  1  ~  11 

t  n 

(iii)  hm(P)  >  I  -  l  n  P(X  <  ). 

j=l  i=l  1  1 

t  n 

(iv)  hm(P)  <  I  -  l  n  P(X  <  mJ.) 

j-1  1*1  1 

t-1  t  n 

+  l  l  n  PCX,  <  min  (nrj ,  ra^)). 

j=l  k=j  +  l  i-1  11 


Example  3.1:  Let  (KXj  ,X2 .X^)  be  parallel  at  level  l,  2-out-of-3  at 
level  2,  and  series  at  level  3.  Let  _P  =  (  .  1 ,  .  1 ,  .4  , .  4)  , 

^2  "  (  .2, .2, .2, .4)  ,  and  _P  =  ( . I , .2 , .2 , . 5) .  Calculations  of  exact 
system  reliability  are  shown  below. 


P(<KX)  >  1)  =  XL  P(X  >  1)  =  .998 
i=  l  1 

p(0(X)  >  2)  =  P(Xt  >  2,  X2  >  2,  or  X  >  2,  X3  >  2,  or 
x2  >  2,  x3  >  2)  =  .788 
3 

P(<KX)  -  3)  -  II  p  -  .08  . 

1=1  iJ 
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Table  3.2  contains  several  upper  and  lower  bounds  for  this  system. 

Notice  that  several  of  the  bounds  are  exact  at  level  1  and  level  3. 

This  is  reasonable  since  series  and  parallel  systems  are  very  simple 
systems.  The  best  bounds  at  level  2  are  the  path-cut  lower  bound  (.761) 
and  Bonferroni  upper  bound  (.812).  Bonferroni  bounds  may  be  outside  the 
unit  interval  and  may  not  be  monotonic  in  the  system  level.  0 


Up  per 

Theorem 

Series- 

Path- 

Max- 

Bonferroni 

Bonferroni 

Ac  tual 

Bounds  at 

3.19C 

Parallel 

Cut 

Min 

(i)and( ii) 

( i ii)and(  iv) 

Value 

Level  1 

1 

.998 

.998 

2.6 

.998 

.998 

Level  2 

1 

.976 

.867 

1 .46 

.812 

.788 

Level  3 

• 

o 

00 

.82 

.08 

.08 

.26 

.08 

Lower 

Bounds  at 

Leve 1  l 

.998 

.576 

.998 

.35 

.998 

.998 

Level  2 

0 

.336 

.761 

.452 

.74 

.788 

Level  3 

0 

.08 

• 

o 

00 

00 

o 

• 

.08 

-.7 

.08 

Table  3.2  Bounds 

For  large  systems,  exact  reliability  calculations  or  even 
computation  of  the  bounds  in  this  section  may  be  difficult.  By 
exploiting  modular  decomposition,  this  computational  effort  can  be 
greatly  decreased.  A  second  benefit  of  modular  decompositions  is  that 
the  resulting  reliability  bounds  are  at  least  as  good  as  the  bounds 
obtained  by  considering  the  whole  system.  The  obvious  way  to  precede  is 
to  establish  upper  and  lower  bounds  on  the  reliability  of  a  module  at 
each  level  and  then  use  those  bounds  as  if  they  were  the  actual 
probabilities  in  the  organizing  structure. 
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Notation;  Let  $  be  a  MSF  composed  of  independent  components  with 
modular  decomposition  $(X)  -  •••.  Xf(A  )). 


i  f  i"l  -i 

DAq)  =  n  XL  P(X  >  *J.) 

♦  J-l  i-1  1  1 

1%)  =  max  (D^S.)) 
v  m<KM  v 


.  s  a 

E*(q)  =  11  n  P(X  >  *j)) 

*  j-l  i-1 


u"(^)  5  min  ( E q )  ) 
^  (X*<m  ♦  " 


VfL>  =  (L°^ . 


V<L)  =  (h°(P) . hJ(P)) 


%(£>  5  O^S.) . ^(q)  ) 


HY(a.)  =  H  (q)  L  (q)  =  L  (q)  U  (q)  =  U  (q) 


Hy  (q) 

*r 


Ly  <S-> 


Ux  (q) 
r 


It  can  be  shown  that  D®(q)  4  L™(q)  and  that  E™(a)  ^”(0 
I  l 

in  general.  D^(a)  and  E^(a)  are  path-cut  lower  and  upper 

bounds  at  level  i  for  independent  components.  Note  that  H^,  L^, 

and  U  are  matrices  with  r  rows  and  M+l  columns. 

X 

Theorem  3.27 ;  Let  $  be  a  MSF  composed  of  independent  components  with 
modular  decomposition  $(X)  **  <KXj(A  ),  ....  x^A^)). 

UHtaJ) 

(i)  V*>  “  W*”  >  {Hl(L:(q)>  »  >  VLX(i>)  *  v*> 


u,(Hv(a)) 

(ii)  H.(a)  -  H,<H  (a»  <  {  *  ^  }  <  U.(U  (*))  <  u  Ca) 


Vux<a>) 


Note  that  the  equations  in  Theorem  3.27  show  that  the  exact  system 
reliability  calculated  by  using  modular  decomposition  is  the  same  as 
that  obtained  by  considering  the  whole  system.  This  must  be  true  from 
the  definition  of  a  module.  Also  note  that  the  path-cut  bounds  obtained 
by  modular  decomposition  are  better  than  the  original  path-cut  bounds, 
L^(t^)  and  U^(£).  Butler  (1982]  has  an  example  of  the  calculations 
involved  for  each  of  the  quantities  in  Theorem  3.27. 

3. A. 2.  System  Utility  Bounds 

The  bounds  in  Section  3.A.1  were  bounds  on  the  probability  that 

the  system  meets  or  exceeds  a  certain  level.  A  useful  facet  of 

multistate  systems  is  the  allowance  for  partial  operation.  For  example 

the  system  states  might  be  the  percent  of  rated  power  generated  at  a 

coal  plant.  In  this  case  there  is  no  minimum  level  which  the  system 

must  meet  or  exceed,  but  there  are  often  requirements  on  the  long  run 

average  capacity.  Letting  states  0,  1,  ....  100  be  the  percent  of 

rated  power  generated,  a  lower  bound  on  kP(  $(X)*k)  is 

„M 

desired.  More  generally,  bounds  on  )  a  P(<KX)  =  k),  where  a 

k= Ok—  k 

is  the  utility  of  state  k,  would  be  useful. 


Theorem  3.28:  Let  <f>  be  a  coherent  MSF.  Let 
state  k  with  a^  =  0,  and  let  b^  =  a^  -  a^ 
Let  1^<  P(<t>(X)  )  k)  (  n.  Then, 

M  M 


be  the  utility  of 
for  k  =  1 ,  . . . ,  M . 


I  b.L  <  EU(*(X))  <  l  b  U  . 
k-1  K  k-1  *  K 

Proof :  Multiplying  L^  P($(X)  by  b^  _>  0,  summing  over 

and  using  EU(  $()( ) )  -  ^  P(  $(X)  k)  yields  the  result.  □ 


k, 
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=  max(L^,... ,l£) 

and  »  min( , . . . ,U^)  where  and  are  the  lower  and 

upper  bounds  obtained  in  Section  3.4.  L. 

Example  3.2:  Consider  the  system  from  Example  3.L  with  a^  =  k 
(or  bk  »  1) . 

EU(4>(X))  =  .998  +  .788  +  .08  =  1.866  . 

The  best  upper  and  lower  bounds  in  Table  3.2  are: 

U  =  L  =  .998 

U2  =  .812,  L2  -  .761 

03  =  L3  =  .08 

3  3 

bL  =  1.829  and 
k=l  *  K  k=l 

Theorem  3.28  becomes  1.829  £  1.866  £  1.88.  Both  upper  and  lower  bounds 
are  with  2%  of  the  actual  value.  Q 

It  is  possible  for  a  raultistate  system  to  be  non-coherent.  For 
example,  if  the  state  of  the  system  is  temperature,  there  may  bo  a  range 
of  temperature  which  is  acceptable  while  both  high  and  low  temperatures 
are  unacceptable.  In  this  case  system  reliability  might  be  defined  as 
P(  ♦(  X)  £  k)  -  P(  <K  X)  £  A)  where  A  -  1  is  the  upper  temperature  limit 
and  k  is  the  lower  temperature  limit.  This  could  be  accomplished  by 
setting  a^  =  0  for  j  <  k  or  j  £  A  and  a^  =  1  for  j  -  k,  k  +  1, 

A  -  1.  Theorem  3.29  provides  for  bounds  on  non-coherent  systems. 


I  b.U.  =  1.88 


The  best  way  to  use  Theorem  3.28  is  to  set 


•  •  •  * 


Theorem  3.29: 


Let  |  be  a  MSF,  and  let  <  P(  $(X)  _>  k)  <  -v  k. 

Let  K+»{k:b^>^0},  and  K  -  {k  :  <  0}  where  =  a^  -  ak_j. 

Then 

l  \\  +  l  bkUk  <  EU(  ♦()())  <  I  bkUk  +  l  bkLk  . 

keK+  keK  kcK+  kcK 

Proof :  For  k  e  K+,  b^  <  bk  P(4>(X)  >  k)  <  bkUk  . 

For  k  «  K~,  bkLk  >  bfc  P($(X)  >  k)  >  b  1)  . 

Summing  on  k  yields  the  result.  Q 

Theorem  3.28  is  obviously  a  special  case  of  Theorem  3.29  with  K  being 
the  null  set  for  a  coherent  system. 

3.4.3.  Bounds  from  Life  Distribution  Processes 

In  this  section  properties  of  life  distribution  processes  are 
used  to  determine  reliability  bounds.  In  the  first  part  of  this 
section,  it  is  assumed  that  the  underlying  distribution  is  known  to  be 
IFR,  IFRA,  DFR,  or  DFRA  and  that  one  parameter  of  the  distribution, 
e.g. ,  a  moment  or  a  quantile,  is  known.  This  kind  of  analysis  is  useful 
in  determining  bounds  on  the  reliability  of  binary  components.  If  a 
component  is  known  to  have  an  IFR  distribution  due  to  wear-out  or  a  DFR 
distribution  due  to  burn-in,  it  may  be  possible  to  estimate  the  mean 
component  lifetime  and  then  use  these  bounds.  This  analysis  can  also  be 
applied  to  the  multistate  case,  but  it  may  not  be  as  useful  because  of  a 
lack  of  data  for  a  parameter  of  the  distribution  F^(t).  The  upper 
and  lower  bounds  on  component  reliability  can  be  used  to  determine  upper 
and  lower  bounds  on  system  rel'ibility.  In  the  second  part  of  this 
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section,  bounds  for  mean  system  lifetime  are  obtained.  If  it  was  known 
that  h  had  an  appropriate  distribution  with  a  known  parameter,  then 
the  bounds  from  the  first  part  of  this  section  could  be  directly  applied 
to  system  reliability.  This  is  unlikely,  however. 

Theorems  3.30,  3.31,  3.32,  and  3.33  appear  in  Barlow  and  Proschan 
[1973a]  and  are  intended  only  as  a  representative  sample  of  the  many 
bounds  contained  in  reliability  literature. 

Theorem  3.30:  Let  F  be  IFRA  (DFRA)  with  pth  quantile  5^,  i.e., 

F(Cp  «  P). 

(  >  (<)  e~at  for  0  <  t  <  £p 

F(t)  j 

(  1  (>)  e  at  for  t  >  Sp 

where  a  »  -(1/?  )  ln(l-p) 

P 

Theorem  3.31:  Let  F  be  DFR  with  mean  p. 

(  e  for  t  <  p 

F(t)  < 

(  pe  Vt  for  t  >  p 

Theorem  3.32:  Let  F  be  IFRA  with  mean  p. 

/  1  for  t  £  p 

F(t)  < 

I  -wt 

'  e  for  t  p 

— wt 

where  w  >  0  satisfies  1  -  wp  -  e 
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Theorem  3.33:  Let  F  be  a  continuous  IFR  distribution  with  rth  moment 

P  .  r  >_  1. 

/  exp  (~t/\^T)  for  t  < 

F(t)  > 

(  0  for  t  >  uj/r 

where  \  =  p^/IXr+l). 

If  (X^(t),  t  >  0)  is  an  IFR  (IFRA,  etc.)  stochastic  process,  then 
the  distribution  F^(t)  is  IFR  (IFRA,  etc.)  for  any  k.  If  we  know 
the  appropriate  parameter  of  the  distribution  F^( t)  for  any  level 
k,  then  the  appropriate  bound  may  be  applied.  The  bounds  may  also  be 
useful  if  F^( t )  is  IFR  (IFRA,  etc.)  for  a  particular  k  even  though 
X^(t)  is  not  an  IFR  (IFRA,  etc.)  stochastic  process. 

In  the  binary  case  it  is  easy  to  obtain  a  bound  on  system 
reliability  from  the  bounds  on  component  reliability.  For  example  if 
component  1  has  an  IFR  distribution  with  known  mean  and  component  2 
has  an  IFRA  distribution  with  a  known  quantile,  and  the  components  are 
independent,  applying  Theorems  3.30  and  3.33  yields 

-t/K 

h(F  ,  F  )  >  h(e  ,e  )  for  0  <  t  <  min  (  £  ,  p  ) 

IS  —  —  p  l 

To  generalize  this  to  multistate  systems,  a  lower  bound  would  have  to  be 
specified  for  several  states  of  each  component.  Also,  the  bounds  on 
F^(t)  and  F^(t)  wsuld  probably  be  correlated  so  that  the 
calculation  of  h  might  not  be  straightforward.  A  way  to  avoid  this 
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problem  is  to  use  the  component  bounds  together  with  the  bounds  on 
system  reliability  from  Section  3.4.1  as  illustrated  in  Example  3.3. 

Example  3.3:  Consider  where  X^  and  are  independent 

ternary  components.  Let  ♦  be  parallel  at  level  1  and  series  at  leve 

2. 

♦(0,0)  =  0  ♦  (  2 , 2)  =  2 

♦(0,1)  =  ♦d.O)  =  ♦(!,!)  =  ♦(2,0)  =  >(0,2)  =  $>(1,2)  =  *(2,I)  =  1 

At  level  1,  the  min  paths  are  (0,1)  and  (l,u),  and  the  only  min  cut 
is  (0,0). 

At  level  2,  the  only  min  path  is  (2,2),  and  the  min  cuts  are  (1,2) 
and  (2,1). 

Applying  the  max-min  bounds  for  associated  components  (Corollary  3.23) 
yields 


max  f  H  P(XJ  >  1^)  <  h1  <  11  P(X.  >  l1) 

i  —  i  —  —  i  i 

j-1,2  i-l  i=l 


max  (P(X1>1),  P(X2>1))  <  hL<  P(X1>0)  +  P(X2>0)  -  P(X  >0)P(X?>0) 
2  2 

n  P(X  >  2JJ  <  h2  <  min  (  Jl  P(X.  >  2^))  or 
1=1  J-1.2  i-l 

P(X  -2)P(X  -2)  <  h2  <  min  (P(X(=2),  P(X2=2))  . 

Now  assume  that  X^(t)  and  X^t)  are  1FR  stochastic  processes  and 
k  k 

that  F  has  mean  p^.  From  Theorem  3.33, 
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-t /\T  k 

F*(t)  >  e  for  t  <  lij 


Using  this  result  and  the  max-min  bounds  yields 


,  0  .  0 

-t/p,  -t/p 


-«■/  e t  -'■i  1  ,  0  Ov 

(e  ,  e  j  <  hl(F(t))  for  t  <  min  (pltP2) 


max 


-t/n}  -t/pj  2  .  1  1. 

2  e  <  h  ( _F( t )  )  for  t  <  min  (p^Pj) 


Since  Fk(t)  is  IFR,  it  is  also  IFRA,  and  Theorem  3.32  applies  to 
give 


k 

-w ,  t 


F^(t)  <  e  1  for  t  >  where  w^  >  0  satisfies 


—v  t 

k  k  i 

1  -  wip1  =  e 


Using  these  inequalities  in  the  max-min  upper  bounds  yields 


0  0  0  0„ 

-w  t  -w  t  -w  t  -»-*■  0  0 

h  (F( t) )  <  e  +  e  -  e  e  for  t  >  max  (Pj,P2) 

w!t  w^t  j  ! 

h2(F(t))  <  min  (e  ,  e  )  for  t  >  max  ( ^  )  .  □ 


The  technique  in  Example  3.3  can  be  used  to  combine  the  bounds  in 
this  section  and  Section  3.4.1  to  derive  bounds  on  system  reliability. 
The  results  in  Section  3.4.2  could  then  be  applied  to  get  bounds  on 
expected  system  utility.  There  are  also  some  bounds  which  apply 
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directly  to  mean  system  lifetime  In  the  binary  case.  In  the  multistate 
case  these  bounds  apply  to  the  mean  time  that  the  system  spends  above  a 
certain  level.  The  following  definition  will  be  useful  in  this 
development. 

Definition:  Let  F  and  G  be  continuous  distributions,  let  G  be 

strictly  increasing  on  its  support,  and  let  F(0)  =  G(0)  =  0.  F  is 
star-shaped  with  respect  to  G,  written  F  <  G,  if  (1/x)  G  ^(F(x))  is 
increasing  in  x  >  0.  Some  properties  of  this  ordering  are: 

(1)  F  <  G  is  scale  invariant  but  not  translation  invariant. 

* 

—  W 

(2)  Let  G(x)  =  1  -  e  .  Then  F  <  G  is  equivalent  to  F 

-  * 

being  IFRA. 

(3)  If  F  <  G,  then  F(x)  crosses  G(9x)  at  most  once  as  x 
increases  from  0  to  ®  for  each  P  >  0.  If  a  crossing 
occurs,  it  occurs  from  above.  If  F  and  G  have  the  same 
mean,  the  crossing  must  occur. 

Theorems  3.34,  3.35,  and  3.36  apply  to  binary  systems  of  binary 
components  and  may  be  found  in  Barlow  and  Proschan  [1975a]. 

Theorem  3.34:  Let  X,(Y.)  have  distribution  F,(G.)  with  mean  li  , 
-  i  l  i  l  i 

and  let  F  <  G . ,  i  =  1,  2,  ....  n.  Let  X.,  ...,  X  be  associated, 
1*1  In 

let  Vj,  . ..,  Y^  be  associated,  and  let  X^  be  independent  of 

Y  v  i,j.  Then  the  mean  life  of  a  series  system  using  components  with 

lifetimes  Xj ,  .  . . ,  X^  is  greater  than  the  mean  life  of  a  series  system 

using  components  with  lifetimes  Y, ,  ...,  Y  while  the  opposite  is  true 

l  n 

for  a  parallel  system.  More  precisely: 
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E(min  (X . ,X  ))  >  E(min  (Y,,...,Y  ))  and 

In—  In 

E(max  (X . ,X  ))  <  E(max  (Y,,...,Y  ))  . 

In—  In 

Theorem  3.35  is  derived  using  Theorem  3. 34  and  property  (2)  of 
star-shaped  ordering. 

Theorem  3.35:  Let  p^  be  the  mean  life  of  a  series  system,  and  let 
Up  be  the  mean  life  of  a  parallel  system  composed  of  n  associated 
components.  Let  the  ith  component  have  an  IFRA  (DFRA)  marginal 
distribution  with  mean  p^. 

00  n  -t /  p  n 

(i)  H  >  (<)  /  H  e  dt  =  (  l  1/p  T1 

s  OH  i=l 

•  A  “t/H 

(ii)  p  <  (>)  /  11  e  1  dt  . 

p - 0  1-1 

Theorem  3.36:  Let  $  be  a  binary  coherent  system  with  min  cut  sets 
K^,  ...,  and  min  path  sets  ....  Pg.  Let  component  lifetimes 

be  associated,  and  let  the  ith  component  have  an  IFRA  marginal 
distribution  with  mean  p^.  Let  T  be  the  system  lifetime. 

-i  00  -t/H 

max  [  £  1/p  ]  £  ET  <  mi n  /  11  e  dt 

j  i€P.  1  "  j  0  icK. 

J  J  J 

To  apply  these  results  to  the  multistate  case,  let  X^  be  the 
time  that  component  i  spends  at  or  above  state  k,  and  let  the  system 
be  series  (parallel)  at  level  k.  Assuming  that  the  hypotheses  of 
Theorem  3.34  hold  for  X^  and  Y^,  the  mean  time  spent  at  or  above 
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system  level  k  by  a  series  system  at  level  k  using  component 
k  k 

lifetimes  X^,  . . . ,  Xfl  is  greater  than  that  obtained  by  using 
k.  k 

Yj,  ....  Ym.  The  opposite  is  true  for  a  parallel  system  at  level 


k  k 

Theorem  3.35  (Multistate  Version):  Let  p  (p  )  be  the  mean  life  of 
-  s  p 

a  series  (parallel)  system  at  level  k,  and  let  the  ith  component  have 

k 

an  IFRA  (DFRA)  marginal  distribution  at  level  k  with  mean  p^. 


,  ®  n  - 1/  p .  n  , 

(i)  >  (<)  /  n  e  1  dt  mil  1/^  )" 

0  i-1  i=l  1 


(ii)  £  <  (>)  /  A 


e  dt  . 


0  i=l 


Theorem  3.36  (Multistate  version):  Let  <J>  be  a  MSF  with  minimum  cut 

structure  functions  ic^ . at  level  m  and  minimum  path 

structure  functions  p  ,  ...,  p  at  level  m.  Let  the  time  that 

1  s 

component  i  spends  at  or  above  level  j  be  IFRA  with  mean  p|,  and 
let  all  such  times  be  associated.  Let  T111  be  the  time  that  the  system 
spends  at  or  above  level  m.  Then 


max  [  l  l/^]~1<ETm<  min  /  H 
l<j<s  i*l  Kj  <t  0  i=  1 


-t/X  J 
e  1  dt 


1  mi  -i  V 

where  =  pi  and  =  p^ 


By  convention  p^  =  ®,  so  1/p^  “  0. 
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Proof:  E(time  p^XO:))  =  1)  <  ET3  <  E(tlme  tc  (X(t))  *  1)  -v  i,j 


max  E  (time  p  (X(t))  •  1)  <_  ET  min  E(time  <.(X(t))  =*  1) 

j  J  j  J 


The  result  follows  from  the  multlstate  version  of  Theorem  3.35  since 


n  . 

[time  p  (X(t))  =  1)  =  E(time  X  m1^  -v  i)  _>  (  £  1/  k*  )  ,  and 

J  i-1  1 


E(time  ic.(X(t)  =  1)  =  E(time  X^  >  m^  for  any  i) 

-i  °°  i11!  -t/X 

=  E(time  X  >  for  any  i)  <  /  I !  e 


-t/\J 

e  1  dt  .  G 


0  i=  1 


Theorem  3.36  obviously  applies  when  each  component  is  an  independent 
1FRA  stochastic  process.  In  that  case  a  bound  on  expected  total  system 
utility  can  be  derived  assuming  that  the  system  runs  without  repair  for 

0  <  t  <  “>.  Multiplying  the  inequalities  in  Theorem  3.36  by  b  and 

—  —  m 

summing  on  m  yields 


I  bm  max  [  l  1/XJJ  1  <  l  b^T™  <  £  bm  min  J  A 

ra=l  ji=l  m=l  m“l  j  0  i*  1 


Note  that  the  middle  terra  is  total  expected  system  utility. 

A  different  idea  for  applying  Theorem  3.36  is  contained  in  Theorem  3.38. 


Lemma 

3.37 

:  If 

>  •  • 

. ,  X  are  associated,  then 

80  are 

n 

1  t  L  L  1 

and 

Vj 

where 

I  and  J  are  any  subsets 

of  the  indices 

1,  .. 

. ,  n. 

and 

Cj.  and 

dj  are  nonnegative  constants. 
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Proof : 


cov[  l  C  X  l  d  X  ]  =  l  l  C  d  COV[X  X  ]  >  0 
id  jej  J  J  iel  jeJ  J  J 

since  the  X^'s  are  associated.  0 

Theorem  3.38:  Let  transitions  between  the  states  of  a  component  be 

independent  IFRA  random  variables  T  for  0  <  j  <  i  <  M.  Then 

-k  k 

v  k,  there  exists  an  IFRA  random  variable  T  such  that  P(T  >  t) 

>  P(Tk  >  t)  v  t. 

Proof :  Let  W  ,  k  =  L,  ...,  R,  be  the  distinct  paths  which  can  be  taken 
to  reach  or  drop  below  state  k  when  the  state  is  always  larger  than  k 
before  the  last  transition. 


^  =  TM,k1  +  Tklfk2  +  +  \,kA+1  Where  k*  >  k’  4+1  <  k 


T  »  min  (W  ) 
k=l , . . . ,R 


V 

Since  sums  of  independent  IFRA  random  variables  are  IFRA,  the  W  are 

lr 

IFRA.  From  Lemma  3.37,  the  W  are  associated.  Thus, 


P(T  >  T) 


P(min  W 
k 


>  t) 


R 

>  n 

k=  1 


P(Wk  > 


t) 


R 

=  n 

k-l 


P(Wk  >  t)  =  P(Tk  >  t)  . 


where  W  ,  . . . ,  W11  are  independent  random  variables  with  the  same 

1  n  — lr 

distribution  as  W  ,  ...,  W  .  The  distribution  of  T  is  the  same  as 
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•r 


the  distribution  of  a  series  system  of  independent  1FKA  random  variables 

- 1  —  n 

(W  ,  W  )  and  is  thus  IFRA  by  the  multistate  1FRA  closure 

theorem.  0 

The  proof  of  Theorem  3.38  is  constructive  in  that  the  distribution 
of  fk  may  be  constructed  from 

R  . 

p(tt  >  t)  =  n  p(vr  >  t)  . 

k»l 

From  Theorem  3.38,  2.  so  we  can  use  in  place  of 

jr?  to  obtain  the  lower  bound  in  Theorem  3.36.  A  special  case  of 
Theorem  3.38  occurs  when  a  component  can  be  represented  by  a  Markov 
process  with  transition  rates  A  ,  0  <  j  <  i  <  M,  and  an  absorbing 
state  0.  This  is  important  since  exponential  failure  rates  are  often 
assumed,  especially  when  the  data  is  scarce. 
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4.  BLOCK  DIAGRAMS  AND  FAULT  TREF.S 

The  main  deterministic  tools  used  by  reliability  engineers  are  block 

diagrams  and  fault  trees.  It  is  assuned  that  the  probability 

distribution  for  the  state  of  each  component  at  a  certain  point  in  time 

is  known  and  that  the  problem  is  to  find  the  probability  distribution 

for  the  state  of  the  system.  Block  diagrams  and  fault  trees  are  used  to 

find  that  distribution.  There  are  many  computer  codes  with  block 

diagrams  or  fault  trees  as  input  and  system  reliability  as  output  in  the 

binary  case,  and  some  work  has  be  n  done  on  algorithms  in  the  ternary 

case.  In  this  chapter  it  is  shown  that  multistate  components  may  be 

analyzed  using  existing  binary  algorithns.  Define  new  binary  variables 

Xy  which  are  1  if  component  i  is  in  state  j  or  higher  and  0 

otherwise.  Block  diagrams  and  fault  trees  may  be  formulated  in  terms  of 

these  binary  variables  to  calculate  the  probability  that  the  system  is 

at  a  certain  level  or  higher.  From  these  calculations,  the  expected 

system  utility  can  be  determined.  Unfortunately,  the  binary  variables 

are  highly  correlated  since  X.  =1  implies  X  ,  =  1  for  k  <  i,  and 

ij  lk 

this  correlation  decreases  computational  efficiency.  This  chapter 
presents  the  new  formulation,  problems  associated  with  it,  and  a  special 
case  in  which  computational  efficiency  may  be  greatly  increased. 


Notation: 


Xij  (X^j  } 


'(X)  =  1 


U(X)  >  k} 


N. 


x,  =  I1  X 


j=l 

M 


ij 


<*>(X)  =  l  <t>K  CX) 
k=l 


*k(x)  h  *k<xll ,x12, 


,X1,N  ’X2l’"-,X2,N„ . Xn,N  ) 

I  2  n 
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Each  $  (X)  is  a  binary  structure  function  that  depends  only  on  binary 

lc  k  k 

components.  Note  that  E$  (X)  “  h  and  that  the  minimum  cuts  for  $ 

are  the  minimum  cuts  for  $  at  level  k. 

A  disadvantage  of  this  formulation  is  that  there  are  many  more  com¬ 
ponents,  (N.+1)»**(N  +1)  as  opposed  to  the  original  n.  Many  of  the 
1  n 

binary  variables  may  also  be  irrelevant.  If  $  is  series  at  level  k, 
then  the  only  relevant  components  for  4>  are  X^k,  ,  ....  X^.  We 
could,  in  theory,  eliminate  these  irrelevant  components  to  get  M 
binary  coherent  structure  functions  of  n  components  each,  but  this  may 
be  more  work  than  solving  the  original  problem.  Another  disadvantage  of 
this  formulation  is  that  independence  among  the  components  is  lost. 

Even  if  the  original  multistate  components  are  independent,  the  binary 
components  X  will  be  associated  in  general  since  X  =  1  implies 

1 J  J 

Xik  =  1  *  k  <  j .  The  binary  structure  functions  are  also  associated 
since  4>J(X)  =  1  implies  < X >  ■  1  v  k  <  j.  An  advantage  of  this 


formulation  is  that  results  which  require  only  that  <KX)  be  increasing 


in  X  follow  immediately  from  the  results  in  the  binary  case.  For 
example,  some  reliability  bounds  for  associated  components  follow  from 
the  corresponding  binary  results.  However,  the  major  advantage  of  this 
formulation  is  that  computer  codes  for  the  binary  case  can  be  applied  to 
the  multistate  case  with  adjustments  in  the  model  rather  than  the  code. 

A  block  diagram  is  a  system  schematic  composed  of  series  and 
parallel  (minimum  and  maximum)  operators.  Each  block  in  a  block  diagram 
represents  a  binary  random  variable.  The  output  of  n  blocks  in  series 
is  rain(X^)  «  while  the  output  of  n  blocks  in  parallel  is 

raax(X^)  ■ 
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Example  4.1:  Consider  the  block  diagram  in  Figure  4.1.  The  system  will 
function  if  component  1  functions,  component  2  functions,  and  one  of 
components  3,  4,  and  5  functions. 

MX)  =  *^2  max(X3,X4,X5)  . 

Computer  codes  generally  perform  these  calculations  in  stages,  e.g., 

5 


Figure  4.1  Block  Diagram 

If  the  components  in  a  block  diagram  are  independent,  the 
calculation  of  system  reliability  is  simple  -  just  replace  X^  by 
P^  =  P(X^  »  l)  and  be  sure  to  use  JL”_3  rather  than  maxCX^) .  In 

Example  4.1,  h(P)  *  ~i-3  ^i’  ^  the  coraPorients  are  dependent, 

then  conditional  probability  expansions  are  used,  e.g.  , 

h(P)  -  P2E( <p(X)  |Xj  =  1)  +  (1-P1)E(<KX)|X1  =  0)  . 

A  block  diagram  is  not  limited  to  systems  which  have  only  series  and 
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paralLel  combinations  of  components  since  a  system  can  be  represented  in 
terms  of  its  min  paths  or  min  cuts  (Theorem  2.3).  However,  if  a 


component  appears  in  more  than  one  min  cut  or  rain  path,  conditional 
probabilities  must  be  used  to  correctly  compute  system  reliability.  The 
output  of  a  block  diagram  is  usually  the  probability  of  system  success, 
but  the  output  can  be  the  probability  of  system  failure  if  the  dual 
structure  function  is  used. 

Example  4.2:  Consider  the  system  shown  in  Figure  4.2.  If  component  1 

functions,  then  the  system  functions.  If  not,  then  one  of  components  2 

and  3  must  function,  and  2-out-of-3  of  components  4,  5,  and  6  mist 

function  in  order  for  the  system  to  function.  Since  components  4,  5, 

and  f)  appear  in  more  than  one  place  in  the  block  diagram,  a  conditional 

probability  expansion  must  be  used  to  calculate  system  reliability. 

Conditioning  on  X  , 

4 


Figure  4.2  Success  Block  Diagram 
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The  dual  of  this  system  is  shown  in  Figure  4.3  with  the  boxes  now 


representing  component  failures  rather  than  component  success.  Note 
that  the  dual  of  a  2-out-of-3  system  is  a  2-out-of-3  system  which  can  be 
represented  in  the  two  different  ways  shown  in  Figure  4.2  and  Figure 
4.3.  0 


Figure  4.3  Failure  Block  Diagram 

To  perform  reliability  calculations  for  multistate  systems,  design 

1c 

block  diagrams  corresponding  to  <p  for  each  system  level  k,  and 
calculate  hk  =  E<t>k(X) .  Then  EU(4>(X))  =  b^^X)  is  the 

expected  sv  •  i  utility.  If  the  components  are  independent  and  if  at 
most  one  level  of  each  component  appears  at  most  once  in  the  block 
diagram,  then  the  reliability  calculation  may  be  performed  as  though  the 
binary  components  were  independent.  Otherwise  conditional  probability 
expansions  are  necessary. 

Example  4.3:  Consider  a  twin  engine  jet  which  can  land  normally  if  one 
engine  is  at  full  power  and  the  other  engine  is  at  half  power.  It  can 
land  on  a  foamed  runway  if  one  engine  is  at  full  power  or  if  both 
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engines  are  at  half  power.  It  will  crash  if  one  engine  is  at  half  power 


and  the  other  engine  is  inoperable.  Let  the  component  state  space  be 
{ 0  =  failed,  I  =  half  power,  2  *  full  power),  and  let  the  system  stat 

space  be  {0  =  crash,  1  =  land  on  foamed  runway,  2  =  land  normally). 

$(2,2)  =  $(2 ,  l )  =  $(1,2)  =  2 

$(0,2)  =  $(2,0)  =  $(1,1)  =  1 

$(0,1)  =  $(1,0)  =  $(0,0)  =  0 

$l(X)  =  max  (X12«X22’X11X21) 

$2(X)  =  max  (X11X22’  Xi2X2l^ 


e 


$  and  $*"  are  shown  in  Figure  4.4.  Note  that  both  the  component 
number  and  the  minimum  required  component  state  appear  inside  the  blocks 
in  the  block  diagram.  U 


$2: 


,r| - 


_  i,2 - ITU 


Figure  4.4  Multistate  Block  Diagrams 

There  are  some  cases  in  which  the  calculations  are  much  easier. 

Consider  a  MSF  which  is  k-out-of-n  at  each  system  level  m  where  k 

is  allowed  to  depend  on  m.  Then,  for  each  level  k,  $  (X) 

=*  $  (X,,  ,X„,  , . . .  ,X  .  ).  Component  states  other  than  k  are  unimportant 
lk  2k  nk 
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in  determining  the  value  of  $  .  This  means  that  if  the  original 
multistate  components  were  independent,  then  the  binary  variables  X^ 

It 

will  be  independent  in  each  0  .  No  conditional  probability  expansions 
will  be  necessary  since  the  xik's  are  independent  for  fixed  k,  and  for 

k 

every  i,  only  one  X  appears  in  the  block  diagram  for  4>  .  This 
situation  occurs  in  Example  4.4. 


Example  4.4:  Let  ^X^X  ,X  )  be  parallel  at  level  1,  2-out-of-3  at 
level  2,  and  series  at  level  3,  and  let  X^,  X^,  and  X^  be  independent. 


4>  (X)  =  l,v3 

Ui=1x 


l2  >2  > 


The  block  diagram  for  each  Q  is  shown  in  Figure  4.3.  Reliability 

1  3 

calculations  may  be  performed  using  independence  for  $  and  $  ,  but 
conditional  probabilities  are  necessary  to  correctly  calculate 
P(*2(X)  -  1).  D 


Figure  4.5  Multistate  Block  Diagrams 


u 

An  even  more  specialized  case  occurs  when  all  of  the  $  have 
identical  block  diagrams.  In  this  case  a  system  analysis  would  require 
only  a  single  block  diagram  with  different  input  probabilities  for 
different  system  levels.  This  would  occur  for  a  system  which  is  series, 
parallel,  or  k-out-of-n  at  all  levels.  It  also  occurs  in  Example  4.5. 

Example  4.5;  A  power  plant  can  generate  0% ,  25%,  50%,  75%,  or  10 OK 
(corresponding  to  states  0,  1,  2,  3,  and  4)  of  rated  electric  capacity 
depending  on  the  condition  of  the  turbine  and  the  amount  of  steam  flow 
reaching  the  turbine.  Three  turbines  are  available,  and  the  one  which 
can  maximize  power  output  is  always  used.  Components  1,  2,  and  3  are 
the  turbines  and  component  4  represents  the  rate  of  flow  at  the 
turbine.  The  block  diagram  for  all  levels  of  this  system  is  shown  in 
Figure  4.6.  □ 


Figure  4.6  Multilevel  Block  Diagram 

Fault  trees  and  event  trees  are  system  models  consisting  of  a  top 
event  and  a  structure  delineating  the  ways  in  which  the  top  event  can 
occur.  The  term  fault  tree  is  used  when  the  top  event  is  system  failure 
while  the  term  event  tree  is  used  for  system  success.  The  tree 
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structure  consists  of  "and  gates"  and  “or  gates"  which  respectively 
perform  the  same  function  in  the  fault  tree  as  the  series  and  parallel 
operators  in  the  block  diagram.  The  inputs  to  these  gates  are  generally 
component  successes  or  failures. 

Example  4.6:  The  fault  tree  and  event  tree  for  the  system  in  Example 
4.2  are  shown  in  Figure  4.7.  The  squares  with  numbers  inside  represent 
an  occurrence  of  component  failure  in  the  fault  tree  and  component 
success  in  the  event  tree.  The  dual  of  a  fault  tree  is  an  event  tree 
and  vice  versa.  It  is  very  easy  to  draw  the  dual  since  the  only  changes 
are  that  "and  gates"  become  "or  gates"  and  "or  gates"  become  "and 
gates".  □ 


Fault  Tree  Event  Tree 


□  - 
Q- 


"and  gate" 
"or  gate" 


Minimum  of  its  binary  inputs. 
Maximum  of  its  binary  inputs. 


Figure  4.7  Fault  Tree  and  Event  Tree 


-94- 


Computer  algor  It  tins  based  on  fault  trees  generally  procede  by 
finding  the  ain  cut  sets  of  a  system.  This  is  a  minimal  group  of 
eleaents  whose  failure  to  function  causes  the  system  to  fail 
to  function.  In  a  fault  tree  the  min  cut  sets  for  the  system  are  the 
ainiaal  sets  of  basic  events  which  cause  the  top  event  to  occur,  l.e. , 
the  min  path  sets  in  the  dual  structure.  Min  cut  sets  are  useful  to 
reliability  engineers  since  they  provide  a  qualitative  measure  of  the 
most  important  components  in  the  system.  The  relationship 

*(X)  -  n  XL  X 

j-1  ieKj 

is  then  used  to  determine  the  value  of  $.  If  the  components  are 
independent  and  each  component  appears  in  at  most  one  min  cut  set,  then 

E<KX)  -  n  11  P  . 

j-1  ieKj  1 

Conditional  probabilities  must  be  used  if  a  component  appears  in  more 
than  one  min  cut  set  just  as  in  block  diagrams.  Calculations  of  system 
reliability  using  fault  tree  computer  codes  generally  take  longer  than 
the  same  calculation  with  a  computer  code  based  on  block  diagrams,  but 
the  output  from  the  fault  tree  evaluation  includes  the  min  cut  sets 
while  the  block  diagram  output  does  not. 

The  extension  of  fault  tree  analysis  to  the  multistate  case  is  very 
similar  to  the  extension  of  block  diagrams.  A  fault  tree  or  event  tree 
is  constructed  for  each  system  level.  Basic  events  are  X^  for  event 
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trees  and  1-X^  for  fault  trees.  A  binary  fault  tree  code  is  used  to 
determine  E+  (X)  for  each  system  level  k,  and  expected  system  utility 
is  calculated  from  EU(tfX))  -  b^E^X). 

Example  4.7:  The  event  trees  for  the  structure  function  contained  in 
Example  4.4  are  shown  in  Figure  4.8.  □ 


Figure  4.8  Multistate  Event  Trees 

The  special  cases  pertaining  to  multistate  block  diagrams  have 
straightforward  analogies  to  fault  trees  and  event  trees.  In  particular 

It 

if  the  4  have  identical  block  diagrams,  the  fault  trees  or  event 
trees  for  each  4  are  also  identical.  This  is  very  useful 


computationally  since  the  min  cut  sets  need  to  be  found  only  once 


Example  4.8;  The  fault  tree  for  the  system  contained  In  Example  4. 
shown  In  Figure  4.9.  The  min  cut  sets  for  $  are  {(4,k)}  and 
{(1 ,k) ,(2,k) ,(3,k) }.  Thus, 

♦kOO  ■  x4k  A  xlk  . 

Assuming  independent  components, 

3 

e.  oo  -  p4k  -U  plk  .  o 


Figure  4.9  Multilevel  Fault  Tree 


5  is 
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5.  THE  CONTINUOUS  MODEL 


In  this  chapter  the  state  of  each  component  and  the  system  is  al¬ 
lowed  to  be  any  real  number  in  the  unit  interval.  The  restriction  to 
the  unit  Interval  is  used  only  for  discussion  purposes,  and  the  results 
are  valid  for  any  finite  segment  (possibly  different  segments  for  each 
component)  of  the  real  line.  The  multistate  model  can  be  considered  as 
a  special  case  of  the  continuous  model  by  restricting  a  component  with 
N  +  1  states  to  values  (0,  1/N,  ...,  (N-i)/N,  l)  in  the  unit 
interval.  This  approach  has  not  been  taken  since  it  is  felt  that  the 
multistate  model  will  be  of  more  practical  importance  than  the 
continuous  model.  Since  any  process  requiring  measurement  has  a  number 
of  states  limited  by  the  precision  of  the  measurement  device,  there  are 
no  truly  continuous  processes  in  reliability  engineering.  For  example, 
temperature  can  be  measured  only  to  the  nearest  degree  or  fraction  of  a 
degree.  In  current  practice  a  continuous  process  is  usually  broken  up 
into  a  finite  number  of  qualitative  states,  e.g. ,  low,  normal,  and  high 
pressure. 

All  of  the  results  previously  presented  for  the  multistate  case 
have  analogues  in  the  continuous  case.  Coherence  means  that  the  struc¬ 
ture  function  is  increasing  in  the  states  of  its  components,  that  an  in¬ 
finitesimal  change  in  any  component  state  may  cause  an  infinitesimal 
change  in  the  system  state,  and  that  an  increasing  utility  function  is 
associated  with  the  states  of  the  system.  Some  concepts  such  as  cut 
sets  do  not  seem  as  relevant  since  there  could  be  an  uncountably 
infinite  number  of  them,  but  the  results  presented  in  Chapters  2  and  3 
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are  still  valid.  Block  diagrams  and  fault  trees  can  still  be  used  to 
determine  the  probability  that  the  process  exceeds  a  certain  level. 
Unfortunately,  however,  they  cannot  be  used  to  determine  expected  system 
utility  unless  the  utility  function  is  piecewise  constant  since  that 
would  involve  an  infinite  number  of  calculations. 

5.1.  Definitions 

The  definitions  in  this  chapter  are  straightforward  extensions  of 
the  definitions  in  Chapter  2.  Let  X^etO.l]  be  the  state  of  the  ith 
component  ([0,1]  is  the  unit  interval),  and  let  $  :  [0,1 Jn  *  [0,1]  be 
the  state  of  the  system. 

Definition :  4>(X)  is  a  Continuous  Structure  Function  (CSF)  if 

(1)  $(0)  -  0,  <KJ.)  -  1,  and 

(2)  <j>(x)  is  increasing  in  X. 

Definition:  Component  i  is  said  to  be 

(A)  Relevant  if  there  exists  X  such  that  >  HO^.X), 

(B)  Fully  Relevant  if  v  0  £  Y  <  Z  £  1,  there  exists  X  such 
that  $(Z  X)  >  <KY1,X). 

Definition:  Let  $<X)  be  a  CSF,  and  let  U((K_X))  be  the 
corresponding  utility  function  which  assigns  utility  a(Y)  to  state  Y. 
Let  a(0)  -  0. 

$00  is  called 

(A)  weakly  coherent  if 

(1)  every  component  is  relevant,  and 

(2)  a(Z)  >  a(Y)  v  0  <  Y  <  Z  <  1. 
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(B)  coherent  if 


(1)  every  component  is  fully  relevant,  and 

(2)  a(Z)  >  a(Y)  *0<Y<Z<1. 

(C)  strongly  coherent  if 

(1)  v  component  i  and  state  y,  there  exists  X  such  that 
♦( Y i »JC )  -  y  but  4>(Zi,X)  i  y  *  Z  +  y,  and 

(2)  a(Z)  >  a(Y)  *0<Y<Z<  1, 

Example  5.1 :  Let  a(X)  ■  X.  Then  $(X)  is  coherent  if  it  is  any  con¬ 
tinuous  monotonic  mapping  from  [0,l]n  onto  [0,1].  For  example. 


♦(xt,x2) 


exp(X2  X2i/2)  -  1 
e  -  1 


If  Xt  >  Yl,  then  tfXpl)  >  4>(Yrl).  If  X2  >  Y2>  then  ♦(I.X^  > 
4>(1,Y2).  Thus,  both  components  are  fully  relevant,  and  the  system  is 
coherent. □ 


To  show  that  a  CSF  is  coherent  or  strongly  coherent  could  be  very 
difficult  since  the  entire  unit  interval  must  be  considered.  A.  situa¬ 
tion  such  as  the  one  in  the  previous  example  must  exist. 

Definition;  A  system  represented  by  CSF  $  is  called 

(1)  Series  if  <fr(X)  -  min  (X,). 

i  1 

(2)  Parallel  if  <KX)  -  max  (Xj. 

“  i  1 

(3)  k-out-of-n  if  $(X)  -  max  {y  ;  >.  *)• 

The  extensions  to  series  (parallel,  k-out-of-n)  at  level  y  are  similar. 
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Theorem  5.1:  Let  $  be  a  CSF. 

(i)  $(X  V  Y)  )>  4>(X)  v  4>(¥)  and  if  ^  ts  Para^lel»  equality  holds, 

(ii)  4>(X  a  V)  <  <Ji(X)  a  $(Y)  and  if  4>  is  series,  equality  holds. 

Notice  that  Theorem  5.1  is  slightly  different  than  Theorem  2.2,  the 
multistate  version.  It  is  no  longer  true  that  $(X  v  Y)  **  $(X)  v  $(Y) 
implies  that  4)  is  a  parallel  structure  as  shown  by  Example  5.2. 

Example  5.2:  Let  $(Xj,X2)  =  max  (XpX2). 

4>(X  v  Y)  =  max  (X^,X2,V^,Y9)  «  max  (X^  v  k2>  Y*  v  Y2> 

=  4>(X)  v  <KY)  . 

However,  4>  is  not  a  parallel  structure  function.  2 

Definition:  Let  $  be  a  CSF.  X  is  a  path  vector  at  level  z  if 

<t>(X)  >.  z.  ft  is  a  path  vector  at  maximal  level  z  if  4>(X)  =*  z.  If,  in 

addition,  <t>00  <  z  whenever  X  <  X,  X  is  called  a  minimum  path  vector 

at  maximum  level  z.  Let  z  =  (z z  )  be  a  vector  such  that  if 

-  —  1  n 

X  >  £,  then  <t>( X)  ■>  z  and  if  Y  <  zL,  then  4>(Y)  <  z.  The  vector  £ 
will  be  called  a  min  path.  The  jth  min  path  at  system  level  z  will  be 
denoted  z^  ■  (z|,...,z^).  Mso, 

Pj(X)  =  L{X  >  =  l{Xt  >  z{>  ls  the  jth  minimum  path 

s true ture  function  at  level _ z. 

If  min  cuts  are  defined  as  they  were  for  MSFs,  they  do  not  exist  in 
general  since  there  is  no  X  such  that  $()C)  <  z  and  <KY)  >  z  when¬ 
ever  Y  >  X  if  4>  is  continuous  in  one  of  its  components.  In  the 
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following  definition,  inequalities  which  occur  in  the  multistate 
definition  are  changed  to  equalities  so  that  rain  cuts  will  always  exist. 

Definition :  Let  $  be  a  CSF.  JC  is  a  cut  vector  at  level  z  if  4>(X)  <  z. 

It  is  a  cut  vector  at  minimal  level  z  if  <KX)  =  z.  If ,  in  addition, 

<>(Y)  >  z  whenever  Y  >  X,  X  is  a  minimum  cut  vector  at  minimal  level  z. 

Let  z  *  (2 . 2  )  be  a  vector  such  that  if  X  <  z,  4>(X)  <  2 

—  1  n  -  —  — 

and  if  Y  >  z_,  <KY)  >  2.  The  vector  £  is  called  a  min  cut .  The 

jth  min  cut  at  system  level  z  will  be  denoted  z^  =  (z\ , . . .  ,z^  ) . 

—  in 

is  the  jth  minimum  cut  structure  function  at  level  £. 

Example  5.3:  Let  (>(X^,X0)  *  .SCX^+X^).  Then,  (X  ,1-X^)  it  -  .  a 

min  cut  and  a  min  path  at  system  level  1/2  -v  0  <  X  <  1.  Note  that 
there  are  an  uncountably  infinite  number  of  min  cuts  and  min  paths  at  an 
uncountably  infinite  number  of  system  levels.  There  is  no  vector  z_ 
such  that  4>(_z)  <  1/2  and  <)>(_Y )  >  1/2  -v  Y  >  z^.  J 

The  change  in  the  definition  of  min  cuts,  although  necessary  for 

their  existence  as  shown  by  Example  5.3,  may  create  difficulties  with 

respect  to  bounding  system  reliability.  Bounds  based  on  min  paths  will 

be  bounds  on  h  =  P(  $(JC)  ^  z)  while  bounds  based  on  min  cuts  will  be 

bounds  on  F.  =  P($(X)  >  z).  However,  if  F.  is  continuous  in  a 
9  —  9 

neighborhood  of  z,  then  hz  «  F,,  and  this  problem  will  not  exist. 

9 

Also,  since  P($(X)  >  z)  <  P(  $(  ,X)  z),  lower  bounds  based  on  cuts  still 

apply. 
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Theorem  5 >2:  Let  $  be  a  CSF. 


s 

(i)  4>(X)  >  z  <*=»  ii  p,(X)  -  1  . 

-  j-1  1  ~ 

t 

(il>  <KX)  >  z  <=4  J1  <.(X)  =  1  . 

If  the  X^  are  Independent,  then  P(p^(X)  ■  1)  »  P(X^  z^) 

and  P( K. ( X )  -  0)  =  n£=1  p(Xt  £  zj) . 

Definition :  Let  $  be  a  CSF.  Its  dual,  <t>^,  is  defined  by 

$D(x)  =  i  -  4>a-x). 

Theorems  2.4  and  2.5  which  state  that  the  dual  of  a  MSF  is  a  MSF 
with  the  same  type  of  coherence  and  that  (<J>^)D  =  $  are  identical  when 
CSF  replaces  MSF.  Theorem  5.3  is  the  continuous  analogue  of  Theorem 

2.6. 


Theorem  5.3:  Let  X  be  a  path  (cut)  vector  at  level  z  for  a  CSF  $. 

Then  _1  -  X  is  a  cut  (path)  vector  for  <t>°  at  level  I  -  z.  Further¬ 
more,  if  z_  is  a  min  path  (cut)  at  maximum  (minimum)  level  z  for  <}> , 

then  I  -  z  is  a  min  cut  (path)  at  minimum  (maximum)  level  1  -  z  for 


Proof :  If  X  is  a  path  vector  at  level  z  for  4>,  4>(X)  >  z.  Thus, 

-  X)  =  1  -  $(X)  <  I  -  z.  If  £  is  a  min  path  for  <t>,  $(0  *  z 
and  v  Y  <  $(Y)  <  z.  Thus, 
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$^(J_  -  z)  *  1  -  <t>( z^)  *1-2  and  *  Y  <  z , 

4>D< j_  -  Y)  >  1  -  z . 

The  results  In  parentheses  hold  by  replacing  $  with  4°  and  X  with 

i  -  x.O 

Let  (C,$)  denote  a  set  of  components  C  and  a  CSF  4>. 

Definition :  (A,x)  is  called  a  module  of  (C,$)  if  A  _c  C  and 

A  AC 

$(X)  »  4»(x(X  )  ,X  )  where  is  a  CSF.  A  modular  decomposition 
is  a  set  of  modules  {( Xj >A^ ),...( Xr*Ar) }  such  that 
♦(X)  =  4>(  X^(XA^ )»•••.  Xj.(XAr) )  where  the  sets  A^  partition  C 
and  4 >  is  the  organizing  structure. 

Theorem  2.8  -  which  shows  that  if  the  modules  and  the  organizing 
structure  possess  certain  types  of  coherence,  then  so  does  the  original 
MSF  -  is  still  valid  with  CSF  replacing  MSF.  In  particular.  Example  5.4 
shows  that  if  x  and  4>  are  both  weak  coherent,  <j>  is  not  necessarily 
weak  coherent.  The  technique  used  in  this  example  transforms  a  multi¬ 
state  system  into  a  continuous  system.  This  technique  can  be  used  to 
extend  all  multistate  examples  and  counter-examples  to  the  continuous 
case. 

Example  5.4:  The  counter-example  from  Theorem  2.8  (111)  is  extended  to 
the  continous  case.  That  example  contained  ternary  components  and  a 
ternary  system. 
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Let  state  0  +  states  [0,1/3],  1  ♦  (1/3, 2/3],  2  ♦  (2/3,1]. 
X(XL,X2)  -  (KXt  ,X2>  -  0  for  0  <  X  <  2/3,  0  <  X£  <  1/3 

x(XrX2)  -  <KX1(X2)  =  1/2  for  0  <  <  2/3,  1/3  <  X£  <  1 

X(Xt,X2)  -  <KXj  ,X2)  -  1  for  2/3  <  Xj  <  1,  0  <  X2  <  1  . 

Let  ♦(Xl,X2,X3)  -  «KX(X1,X2),X3). 

Then  4>(X^  ,02  ,X^)  =  4>(X  ,  1 2  .X^)  so  component  2  is  not  relevant,  and 
4>  is  not  weakly  coherent.  □ 


Theorem  3.4:  The  dual  of  a  module  is  a  module  in  the  dual,  i.e.  , 
Ax)  =  AxD(XA).XA  ). 


Proof  : 


<>d(i-x(1a-xa),xaC) 

c  c 

=  1-<K  xOA-xA)  ,j.A  -xA  ) 

«  i-<t»U-x)  -  Ax)  .  U 


5.2.  Extension  of  Stochastic  Results 

Throughout  this  section,  it  wilL  be  assumed  that  F^(y)  = 

P(<KX)  >  y)  is  continuous  in  y.  This  implies  that  P(  4>(X)  ^  y)  = 

P( $(X)  >  y),  and  the  problem  associated  with  the  definition  of  min  cuts 
does  not  exist.  This  could  be  generalized  to  handle  random  variables 
which  have  both  discrete  and  continuous  parts,  but  it  does  not  seem 
worthwhile  in  a  reliability  context. 


In  the  continuous  case  a  utility  function  has  a  continuous  domain, 
{a(X),0<^X<  l  },  rather  than  a  discrete  domain,  {a^  ,  j-1 , . . .  ,M).  To 
get  a  continuous  analogue  of  bj  ■  a^  -  aj_j>  8et  a(0)  *  0,  assume 
a(X)  is  continuously  differentiable,  and  integrate  by  parts. 
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EU(  $(X)  )  -  /q  a( Y)dF^(Y) 

-  «(^VY)|0  '  ^  •*<Y)%(V)dY 

»  a(  1 )  -  /q  a'  (Y)dY  +  a*  (Y)  [l-F^(Y)  ]dY 
*  a( 1 )  -  a( 1)  +  Jq  a' (Y)F^(Y)dY 

-  /q  a'(Y)P($(X)  >  Y)dY  . 

Thus,  a'(Y)  is  the  desired  analogue  to  b^  .  Proper  definition 
of  a'(Y)  at  discontinuities  of  a(Y)  would  allow  this  situation  to  be 
slightly  generalized,  but  for  applications  it  seems  easier  to  "smooth" 
a(Y)  than  to  be  mathematically  precise. 

Theorem  5.5: 

EUUOO)  =  !q  EU<$<Y.,X)dF.(Y)  . 

Proof : 

EUU(X))  -  Jq  E(U($(X))|X.«Y]dF.(Y) 

-  Jq  E[U(WYi,X))]dFi(Y)  .  □ 

Theorem  5.6:  Let  4>  be  a  coherent  CSF. 

(i)  EU(<XX  v  Y))  >  [EU(«KX))1  v  IEU(«KY))1  and  if  ^  is  a 

parallel  CSF,  equality  holds. 

(ii)  EU(<KX  AY))  <  [  EUC  4>(  X) )  ]  a  [EU(4>(Y))j  and  if  *  is  a 

series  CSF,  equality  holds. 


Proof: 


(i)  P(*(X  v  Y)  >  z)  -  P($(X)  >  z)  v  P(<KY)  >  z) 


-  J  I  u 

X  Y 


{♦(x  v^)2z  > 


1  {$(x)  v$(£)>z } 


JdF^xJdF^jO 


>  0 
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where  equality  holds  If  0  is  a  parallel  CSF  from  Theorem  5.1.  Multi¬ 
plying  by  a'(Y)  >  0  and  integrating  from  0  to  l  finishes  the  proof. 
Part  (ii)  is  similar.  □ 

Theorem  5.7:  Let  X( Y)  have  distribution  ).  If  3.  3.'  and  0 

is  a  weakly  coherent  CSF,  then  EU($(30)  <  EU(0(Y^). 

Proof:  Since  0(X)  is  increasing  in  X*  P(  0(X)  2  z)  .£  000  2  z)  v  z* 

Multiplying  by  a'(z)  >  0  and  integrating  yields  the  result.  D 

Definition:  The  X,Y  reliability  importance  of  component  i  at  level 

z  is 

I*’Y  <i)  =  P(0(X.,X)  >  z)  -  P(0(Y.  ,X)  >  z)  for  0  <  Y  <  X  <  1. 
a  l  —  —  i  —  —  —  ii  — 

However,  this  measure  of  reliability  importance  is  not  as  appealing 

as  the  mulristate  r,s  reliability  importance  since  there  are  an 

infinite  number  of  X,Y,  and  z  to  choose  from.  Also,  it  is  not  so  easy 

to  change  the  distribution  of  some  X.^  and  derive  a  nice  expression  for 

X  Y 

the  change  in  reliability  in  terms  of  1^’  (i).  There  is  a  stronger 

objection  to  extending  the  r,s  criticality  importance  to  the 

continuous  case.  The  denominator  of  this  measure  is  (N  +1)  •••  (N  +1) 

l  n 

representing  the  total  number  of  component  states.  This  denominator 
will  be  infinite  in  the  continuous  case.  There  are  also  problems  with 
most  of  the  other  importance  measures  since  they  contain  expressions 
such  as  h^C  (k+l)^,£)  -  hm(k^,j?).  It  might  be  possible  to  consider 
derivatives  and  then  integrate  over  the  unit  interval,  but  this  does  not 
seem  very  useful.  However,  the  multistate  extension  of  reliability 
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importance  due  to  Griffith  [1980]  seems  to  generalize  nicely  to  the 
continuous  case. 


Theorem  5.8: 


1  L 

EU(<KX)>  -  /  a'  (y)P(#(0  •  ,X)  >  y )dy  +  /  I(i)F  (z)dz 
0  0 


where 


l(i)  -  /  a' (y)  [^P($(zi,X)  >  y)dy  . 


Proof: 


1 

P(*(X)  >  y )  -  /  P($(z  X)  >  y)dF  (z) 

0  1  1 

l  1  a 

*  PUU^X)  >  y)Fi<z)  Izai0  -  /  [^P(^)(zi,X)  >  y);?l(z)dz 

l  A 

=  P<  <t>(  1  .  ,X)  >  y)  -  /  [-P($(z.,X)  >  yjdz 

i  0  dz  t  -  ~ 

l  , 

+  /  It;P(0(z.  ,X)  >  y](l-F.(z))dz 

0  az  l  i 

l  , 

-  P(^(0t,X)  >  y)  +  /  l£P(W*1.X)  >  y)]Fi(z)dz  . 

Multiplying  both  sides  of  the  above  equation  by  a'(y)  and  integrating 
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1 

EUUOp)  -  /  a*(y)PU(0  ,X)  >  y)dy 

0 

1  1  , 

+  /  a'(y)  /  [r-P($(z ,  ,X)  >  y)]F  (z)dzdy  . 

0  0 

The  result  follows  by  interchanging  the  order  of  integration  in  the 
second  term.  0 

Corollary  5.9:  If  component  i  is  stochastically  improved  to  a  new 
distribution  F*  with  F*(z)  F^z)  v  z,  the  change  in 
expected  utility  is 

1 

AEU( $(  X) )  -  /  I(i)[F*(z)  -  F  (z)]dz  . 

0 

The  definitions  of  IFRA  ( IFR,  etc.)  stochastic  processes  apply  to 
the  continuous  case  as  written.  In  fact  the  IFRA  and  NBU  closure 
theorems  (closure  under  formation  of  coherent  systems)  from  Section 
3.4.3  may  be  found  in  the  literature  for  the  continuous  case  rather  than 
the  multistate  case.  The  other  closure  theorems  and  proofs  apply 
immediately  to  the  continuous  case. 

The  bounds  in  Section  3.4.1  are  upper  and  lower  bounds  on 
P( ♦( X)  k).  These  bounds  apply  immediately  to  the  continuous  case  with 
k  restricted  to  the  unit  interval.  In  some  of  the  bounds  there  are 


products  of  rain  paths  or  min  cuts.  Since  this  number  may  now  be 
uncountably  infinite,  these  bounds  may  be  difficult  to  compute.  If  this 
occurs,  other  bounds  can  be  used  or  approximations  can  be  made.  The 


Bonferroni  bounds  contain  suns  of  a  number  of  min  paths  or  min  cuts. 
Depending  on  the  problem,  it  may  be  possible  to  transform  these  sums 
into  integrals  or  to  approximate  the  CSF.  Example  5.6  shows  how  a  CSF 
might  be  approximated  by  a  MSF. 

Example  5.6:  Consider  ■  .5(X^  +  X^) .  There  are  several  ways 

to  approximate  this  CSF  by  a  MSF.  As  an  example, 

X’  -  [ioxJ  X*  -  [iox2] 

♦•(X1,X2)  -  [ .5(X’  +  X^)] 

where  [x]  is  the  largest  integer  less  than  or  equal  to  X.  Multistate 
techniques  can  then  be  applied  to  $.  On  the  other  hand,  if  X^  and  X2 
are  associated  random  variables  with  densities  f  ^  and  f2>  a  direct 
calculation  is  possible. 

P(«X)  >  j)  -  P(X1  +  x2  >  1) 
i  I 

■  /  /  f.7(x,y)dxdy 

0  1-y 

1  1 

>/  I  f . (x)f  (y)dxdy 

0  1-y  1 

1 

■  /  f ,(y)F?(l-y)dy  .  0 

0 

Bounds  on  expected  system  utility  can  be  obtained  easily  from  the 
bounds  on  h2  -  P($(X)  >  z).  Let 

L(z)  <  PU(X)  >  z)  <  V(z)  . 
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Multiplying  by  a' (z)  0  (coherent  systems)  and  integrating  yields 

1  1 
J  a'  (z)L(z)dz  <  EU(<KX))  <  /  a*(z)V(z)dz  . 

0  0 

This  equation  is  the  analogue  of  Theorem  3.28.  To  get  an  analogue  of 
Theorem  3.29,  let  Z+  -  (z  :  a'(z)  >  0>  and  let  z”  -  {z  :  a'(z)  <  0}. 
Then 

!  a' (z)L(z )dz  +  /  a'(z)V(z)dz  <  EU(9(X)  <  /  a’(z)L(z)dz 

z  z"  ~  ~  z~ 

+  /  a* (z)V(z)dz 
Z 

Analogous  results  for  the  remainder  of  the  results  in  Section  3.4  follow 
immediately  by  restricting  the  component  and  system  states  to  the  unit 
interval. 
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PART  II 


OPTIMAL  MAINTENANCE  OF  MULTI STATE  COMPONENTS 

The  theory  presented  in  the  first  part  of  this  thesis  is  mainly  de¬ 
terministic,  meaning  that  time  is  either  fixed  or  is  not  considered  as  a 
parameter  in  the  reliability  calculations  and  that  no  optimization  is 
performed.  This  part  of  the  thesis  is  stochastic  in  that  it  deals  with 
selecting  the  best  strategy  for  the  maintenance  of  a  system  over  an  in¬ 
finite  time  horizon.  Determining  the  optimal  maintenance  policy  for  a 
system  with  failure  and  repair  characterisitcs  that  vary  in  time  could 
be  a  difficult  task.  Thus,  the  usual  assumptions  are  that  system  opera¬ 
tion  is  a  Markov  or  semi-Markov  process  and  that  regeneration  points 
exist.  These  assumptions  allow  Markovian  decision  processes  to  be 
utilized. 
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6. 


INTRODUCTION  AND  HISTORY 


Most  of  the  literature  on  optimal  maintenance  of  multistate  systems 
pertains  to  inspection  models  in  which  the  state  of  the  component  is 
discovered  at  the  end  of  each  time  period,  and  a  decision  must  be  made 
to  replace  the  component  or  leave  it  in  operation.  The  objective  is  to 
minimize  costs  which  consist  of  a  constant  replacement  fee  and  a  penalty 
cost  if  the  component  enters  its  worst  state.  Since  the  transition  laws 
are  Markovian,  the  system  is  analyzed  using  discrete  time  Markov  deci¬ 
sion  processes.  The  main  results  are  control  limit  rules  which  call  for 
replacement  of  the  component  when  it  reaches  or  drops  below  a  certain 
state  (called  the  control  limit).  This  model  and  several  variations  of 
it  are  discussed  in  Section  6.2  after  an  introduction  to  Markov  decision 
processes  in  Section  6.1.  These  models  regard  state  0  as  the  best 
state,  and  states  1,2,  ....  M  as  increasingly  degraded.  The  opposite 
convention  is  used  herein  so  that  the  notation  in  Parts  I  and  II  of  this 
thesis  is  consistent. 

The  models  in  this  thesis  differ  from  previous  models  in  that  the 
components  are  assumed  to  be  constantly  monitored  rather  than  periodi¬ 
cally  inspected.  Thus,  the  decision  to  retain  or  replace  a  component 
may  be  made  at  any  time  instead  of  immediately  following  an  inspection. 
In  Chapter  7  the  repair  or  replacement  process  occurs  with  the  system 
inoperable.  It  is  shown  that  an  equivalence  exists  between  the  contin¬ 
uous  time  model  and  the  discrete  time  models  previously  considered  in 

the  literature.  Thus,  the  major  results  are  control  limit  rules.  Shock 

models  are  those  models  in  which  damage  to  a  component  accumulates  via  a 
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Poisson  process,  and  a  decision  is  made  after  a  shock  occurs  to  retain 
or  replace  the  component.  In  Section  7.4  it  is  shown  that  an 
equivalence  exists  between  shock  models  and  the  continuous  time  model 
presented  herein.  Thus,  control  limit  rules  which  apply  to  shock  models 
also  apply  to  the  continuous  time  model.  In  Chapter  8  it  is  assumed 
that  the  component  or  system  continues  to  operate  while  the  repair 
process  is  taking  place.  The  problem  is  to  decide  from  which  states  ami 
to  which  states  the  process  should  be  repaired.  Theorems,  including 
control  limit  rules,  are  presented  to  help  resolve  that  issue.  Choosing 
between  repair  and  replacement  of  a  component  with  the  system  inoperable 
is  the  subject  of  Chapter  9.  The  component  may  be  repaired,  replaced, 
or  left  alone,  and  it  is  shown  that  a  type  of  control  limit  rule  is 
optimal. 


6.1.  Markov  Decision  Processes 

The  main  ingredients  of  discrete  dynamic  programming  are  a  state 
space  S,  a  set  of  actions  A,  rewards  r^(a),  and  transition  probabili¬ 
ties  p^(a).  The  process  begins  a  period  in  a  state  i  e  S,  and  an 
action  a  c  A  is  chosen.  A  reward  r^(a)  is  received,  and  the  process 
jumps  to  state  j  to  start  the  next  period  with  probability  p^(a) 
(where  j  -  i  is  allowed).  This  series  of  events  is  repeated  indefi¬ 
nitely.  The  process  is  a  discrete  time  Markov  decision  process  (DTMDP) 


if  the  rewards  and  transition  probabilities  depend  only  on  the  current 

state  and  chosen  action.  Thus,  p^(a)  are  Markov,  i.e.  ,  p^(a) 

■  P(X  ■  j  I  X  -  i,  action  a  is  chosen)  is  independent  of  n  and 
n+1  n 

the  past  history  of  the  process.  It  is  assumed  throughout  this  thesis 
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Chat  S  and  A  are  finite.  Rewards  are  also  finite,  and  they  will 
generally  be  considered  costs  rather  than  rewards  since  the  optimization 
criterion  used  herein  is  minimizing  cost  rather  than  maximizing  profit. 

A  policy  R  is  a  rule  for  selecting  a  decision  in  each  state  at 
each  point  in  time.  Let  C  be  the  set  of  all  policies.  By  the  Mar¬ 
kovian  assumption,  it  is  sufficient  to  consider  the  subclass  of  policies 
that  depend  only  on  the  current  state  and  period.  For  an  infinite  hori¬ 
zon  problem,  the  policies  will  depend  only  on  the  current  state,  i.e., 
they  will  be  stationary.  It  can  also  be  shown  (see  Derraan  [1962])  that 
it  is  sufficient  to  consider  the  class  of  non-randotnized  policies, 
i.e. ,  policies  which  assign  only  one  action  to  each  state  in  each 

period.  Let  {X  ,n*0,l,...}  be  the  sequence  of  observed  states,  let 
n 

{A  ,n»0,l,...}  be  the  sequence  of  observed  actions,  and  let 
n 

{VI  ,n**0,l,...}  be  a  sequence  of  random  variables  such  that  W  =  r  (a) 
n  n  i 

if  X  *  i  and  A  -  a.  The  problem  is  to  minimize  discounted  or  un- 
n  n 

discounted  costs  over  an  infinite  time  horizon.  When  discounted  costs 
are  considered,  a  discount  factor  a  will  be  used.  Expected  costs  in  a 
single  period  when  policy  R  is  used  are 

Vo  ’  X  l.  P*<Xn  ■  *n  ‘  • 


The  undiscounted  costs  for  a  finite  horizon  N  using  policy  R  and 
starting  in  state  i  is 


Vl> 


N 

e  c  y  w  ) 

R  n  n 

n=0 


N 

l 


l  l 


n*0  jcS  aeA 


WJ  Va 


vi)ya> 
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Continuous  time  Markov  decision  processes  (CTMDP)  are  similar  to 
DTMDP.  Upon  arriving  in  state  i  e  S,  an  action  a  e  A  is  chosen,  and 
a  reward  r^(a)  is  earned.  The  holding  time  in  state  i  is  exponen¬ 
tially  distributed  with  parameter  A^Ca),  and  the  process  then  jumps  to 
state  J  with  probability  p^(a)  (p^  >  0  is  permitted).  The  re¬ 

wards,  holding  times,  and  transition  probabilities  depend  only  on  the 
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current  state  and  chosen  action.  S  and  A  are  assumed  to  be  finite, 


and  the  objective  is  to  minimize  cost.  In  the  discounted  case,  a 
discount  rate  P  ( os«e  P)  will  be  used.  Let  {Y(t),t>0}  be  the  state 

of  the  process  at  time  t,  let  Y(t)  -  Y  if  T  <  t  <  T  where  T 
r  n  n  —  —  n+1  n 

is  the  time  of  the  nth  jump,  let  A  be  the  action  chosen  in  state  Y  , 

n  n 

and  let  N  =  max  (n:T  <t).  The  total  expected  discounted  cost  using 
t  n— 

(stationary)  policy  R  with  initial  state  i  is 


WR(i.B) 


on 


(A  ) 


i) 


The  expected  average  cost  is 


VR(l) 


=  lira  eh 

t-»<t>  R 


(T  l  rYn(An) 

c  n=0  Yn  n 


i) 


when  the  limit  exists. 

The  Markov  decision  processes  in  this  thesis  are  usually  derived 

from  discrete  time  Markov  chains  (DTMC)  and  continuous  time  Markov 

chains  (CTMC).  Some  results  and  notation  from  the  theory  of  Markov 

chains  will  be  useful.  The  Markov  chains  in  this  thesis  will  be 

irreducible  and  positive  recurrent  on  a  finite  state  space. 

Let  (X  ,0=0,  l,...}  be  a  DTMC. 
n 

Notation: 


P 

P 

P 


1J  S  P(Xn+l'J  I 


I V1  " 


•xo-*> 


-  (p^)  is  tlie  matrix  transition  probabilities 


‘“h  P(Xn-J 


V1’ 


V  n(n-0 
l  Pjr  pkj 
keS 
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n  =  lim  p..  are  called  the  steady  state  probabilities. 

J  n-*» 

nP  =  n,  n  >  0  -v  i,  and  u  =  1  . 

i  —  i 

keS 


Let  {X(t),t>0)  be  a  CTMC. 


Notation:  X..  is  the  transition  rate  from  state  i  to  state  j. 

- ij 

Q  =  (  X_  )  is  the  matrix  of  transition  rates  with  diagonal 


elements  X 


i 


eS 


ij 


‘t. 


eS 


ij 


is  the  parameter  of  exponential  holding 


t  ime  in  s  tate  1 

m(i)  =  1 / is  the  mean  holding  time  in  state  i 

p  =  X.  /X  is  the  transition  probability  from  state  i  to 
ij  lj  i* 

state  j 

.  is  the  mean  time  from  arrival  in  state  i  until  the  next 
ij 

arrival  is  state  j,  sometimes  called  a  first  passage  time 


it,  =  m(j)/|i.  .  are  the  steady  state  probabilities 
J  JJ 


6.2.  Derman's  Model  and  Extensions 

The  use  of  discrete  dynamic  programming  to  determine  optimal 
maintenance  policies  for  multistate  systems  was  pioneered  by  Derman  in 
the  1960's.  Derman  [1962]  showed  that  only  non-randomized  decision 
rules  need  be  considered,  and  in  Derman  [1963]  it  was  shown  that  control 
limit  rules  are  optimal  for  certain  types  of  systems.  The  discussion 
herein  follows  Derman  [1970]. 

Consider  a  component  or  system  which  is  inspected  at  equally  spaced 
points  in  time  and  classified  into  state  0,1,..., M  with  state  M 
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being  che  best  state.  At  the  beginning  of  each  period,  an  operator  may 
choose  between  two  actions  -  to  replace  (which  takes  one  period)  or  to 
retain  the  component.  The  component  must  be  replaced  when  it  reaches 
state  0.  A  control  limit  rule  is  a  policy  which  replaces  the  component 
when  the  observed  state  is  0,  1,  ...,  i*  and  retains  the  component 
when  the  observed  state  is  i*+l,...,M  where  state  i*  is  called  the 
control  limit.  Replacing  the  component  costs  C  >  0  units  of  money  and 
there  is  an  additional  penalty  cost  K  >  0  if  the  component  is  replaced 
from  state  0.  The  states  may  be  considered  as  the  remaining  capability 
of  the  component,  and  a  penalty  is  assessed  if  the  component  ever 


becomes  completely  inoperative.  Markovian  transition  probabilities 
are  used.  It  is  also  assumed  that  >  0  f°r  some  n  _>  1  so  that 


a  component  which  is  not  replaced  will  eventually  fail. 


In  the  notation  of  dynamic  programming,  the  problem  becomes 
S  =  { 0 , l , . . . M } ,  A  =  (0  *  do  nothing,  1  *  replace}, 


r  ^ (a)  = 

pu<*>  ’ 


a  =  0 

a  -  1,  i  >  1 
a  =  1 ,  i  ■  0 

a  =  0 

a  *  1 ,  j  »  M 
a  *  l,  j  j4  M 


The  standard  dynamic  programming  recursion  for  the  discounted  cost  case 
is 


M 

4>(i,a,N+l)  =  min  {a  'l  p  ,a,N) ;  C  +  aiHM,a,N)}  for  i  ^  0 

j*0 

=  C  +  K  +  a<|>(M,a,N)  for  i  *  0  . 
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This  is  slightly  different  from  Derman's  construction  since  he  assimes 

that  replacement  is  instantaneous.  In  that  case  ^(M.a.N)  is  replaced 
cM 

by  ijaO  ^Mj  •  a»^  a^ove  equation.  This  might  also  be  use¬ 

ful  if  the  new  component  could  be  in  any  state  due  to  damage  in  fabrica¬ 
tion  or  shipping. 

The  main  result  in  Dennan  [ 1970]  is  that  if  K  n  p.,  is  noain- 

-j=0 

creasing  in  i  V  k,  then  a  control  limit  rule  is  optimal  for  both  the 
expected  average  cost  criterion  and  the  total  expected  discounted  cost 
criterion.  This  means  that  there  exist  control  limit  policies  R*  and 
R**  such  that 


R*  =  arg  min  [<|>  (i,a))  and  R**  =  arg  min  [A  (i)]. 

K  R  K  R 


The  policies  R*  and  R**  may  be  determined  by  solving  linear  pro¬ 
grams,  and  they  will  generally  be  identical  for  small  interest  rates. 

The  restriction  that  lj-Q  P ^  be  nonincreasing  in  i  V  k  seems 
very  reasonable  since  it  means  that  if  no  replacement  occurs  and  one 
component  begins  a  period  in  a  better  state  than  another  component,  the 
first  component  will,  on  the  average,  also  begin  the  next  period  in  a 
better  state. 

There  have  been  several  extensions  of  the  basic  model.  In  Kolesar 
[1966)  a  state  occupancy  cost  was  added  to  the  model  so  that  cost  A^ 
was  charged  each  time  the  component  was  observed  in  state  i.  Derman's 
construction  is  a  special  case  of  this  with  =  K  when  replacement  is 
the  chosen  action  in  state  0  and  A^  *  0  otherwise.  A  control  limit 
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rule  Is  shown  to  be  optimal  under  the  additional  assumption  that  is 

nonincreasing  in  i.  The  costs  A^  are  not  restricted  to  be  nonnega¬ 
tive,  and  -  -A  may  be  thought  of  as  a  utility  rather  than  a  cost. 

The  problem  is  then  to  maximize  utility,  and  a  control  limit  rule  is 
optimal  when  is  nondecreasing  in  i.  Thus,  the  analysis  applies  to 

coherent  systems. 

A  generalization  of  Kolesar's  model  is  contained  in  Ross  [1969]. 
This  model  permits  a  continuous  state  space  which  is  useful  in  inventory 
applications.  A  control  limit  rule  is  again  shown  to  be  optimal.  The 
model  also  applies  to  the  case  in  which  several  components  may  be  or¬ 
dered  at  once.  The  state  of  the  system  is  (n,x)  where  n  represents 
the  number  of  spare  components,  and  x  represents  the  state  of  the  com¬ 
ponent  currently  in  operation.  When  the  system  is  in  state  (0,0),  any 
number  of  components  may  be  ordered.  It  is  shown  that  a  control  limit 
rule  applied  to  the  component  currently  in  operation  Is  optimal. 

The  basic  model  has  been  extended  to  include  more  general  cost 
structures  and  state  transitions.  In  Kalymon  [1972],  the  replacement 
cost  is  random,  and  there  is  a  salvage  value  which  depends  on  the 
replaced  component's  state  and  on  the  replacement  cost.  The  total  cost 
of  replacement  is  then  C  +  r(C)  +  s(i)  where  C  is  a  random  variable 
and  the  salvage  value  is  -(r(C)  +  s(i)).  This  leads  to  the  following 
recursion. 

M 

<J>(i,a,N+l)  -  min  {A  +  a  l  P .  j  ,ct ,N)  ; 

J-0  J 

Ai  +  C  +  r(C)  +  s(i )  +  a<KM,a,N)}  . 
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The  following  monotonicity  assumptions  are  made. 

(1)  and  s(i)  are  nonincreasing  in  i. 

(2)  C  +  r(C)  is  nondecreasing  in  C. 

(3)  a  P1+1  jlA.-sCj)]  -  a  l”.0  p^-sCj)] 

>  s( i+1)  -  s(i)  v  i  <  H  -  2. 

The  third  condition  seems  rather  strange,  and  no  heuristic  reason  for 
its  necessity  is  provided  in  the  paper.  If  these  monotonicity  assump¬ 
tions  are  added  to  the  assumptions  in  the  basic  model,  a  control  limit 
rule  is  optimal  for  both  the  discounted  and  average  cost  cases. 

In  Kao  [1972J  the  transitions  are  semi-Markov  rather  than  Markov. 
The  holding  time  in  a  state  can  then  depend  on  the  current  state  and  the 

target  state,  but  only  transitions  to  smaller  states  are  allowed.  Al¬ 
though  the  possibilities  of  variable  replacement  cost  and  variable  re¬ 
placement  time  are  discussed,  they  are  held  constant.  It  is  also 
assumed  that  the  expected  cost  per  occupancy  in  state  i  is 
nonincreasing  in  i.  Under  these  conditions,  a  control  limit  rule  is 
optimal. 

In  all  the  aforementioned  models,  it  is  assumed  that  inspections 
occur  in  every  time  period  and  that  replacement  provides  a  new  unit. 
Adding  inspection  scheduling  and  repair/replacement  considerations  into 
the  decision  process  further  complicates  the  basic  model.  In  Klein 
[1962]  the  decision  space  is  enlarged  to  allow  repair  to  any  state  and 
scheduling  of  the  next  inspection  some  number  of  periods  later.  Costs 
of  repair,  replacement,  inspection,  and  penalty  costs  for  failure  are 
Included  in  the  model.  It  is  shown  that  the  problem  may  be  formulated 
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as  a  linear  program,  but  no  discussion  of  control  limit  policies  or  the 
form  of  the  optimal  solution  is  included.  Another  model  of  this  type  is 
due  to  Luss  (1976].  In  this  paper  it  is  assumed  that  a  control  limit 
rule  is  optimal,  and  an  algorithm  is  presented  to  find  the  optimal  con¬ 
trol  limit.  State  transitions  are  governed  by  a  continuous  pure  death 
process.  This  is  converted  to  a  discrete  model  in  which  opportunities 
for  inspection  occur  at  regular  points  in  time.  State  occupancy  costs 
as  well  as  maintenance,  inspection,  and  penalty  costs  are  included  In 
the  model. 

An  interesting  type  of  control  limit  rule  is  contained  in  Rosen- 
field  [1976A].  Let  be  the  cost  of  repair  from  the  jth  state  (in¬ 

cludes  penalty  cost),  let  I  be  the  constant  inspection  cost,  and  let 
A..  be  the  state  occupancy  cost.  In  each  time  period,  the  operator  may 
choose  to  inspect  the  component,  replace  the  component,  or  do  nothing. 
The  system  state  is  (i,k)  which  means  that  the  system  was  in  state  i 
at  its  last  inspection,  k  periods  ago.  This  leads  to  the  following  re¬ 
cursion. 

<J>(  l  ,k,a  ,N+1)  =  min  {inspect;  replace;  do  nothing) 


M  M 

rain  {I  +  l  pj^A  +  “  1  ,0,a,N)  ; 

j-0  J  3  j-0  J 


M  .  . 

I  p^JC  +  a<J»(M,0,a,N) ; 
J=0  1J  3 


M 


l  +  a<Ki , j  +  1  >a,N) }  for  i  >  0 

j-0  J  3 
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The  following  assumptions  are  made: 


(1) 

(2) 


C.,  A.,  and  C  -A.  are  nonincreasing  in  j. 

J  j  j  j 

>'*  „  pJ  .  is  nonincreasing  in  i  v  k,  and  >  0 

H-0  rii  °  ri0 


some  n  >  1. 


for 


(3)  P  is  upper  triangle,  i.e.,  p  =  0  v  j  <  i. 

(4)  P  is  totally  positive  of  order  2  (TP2),  i.e.,  PikPj£  >. 
v  i  >  j,  k  >  i. 


If  these  assumptions  hold,  Rosenfield  shows  that  a  4-region  policy  is 
optimal.  A  4-region  policy  is  one  in  which  for  fixed  i  and  increasing 
k,  it  is  optimal  to  first  do  nothing,  then  to  inspect,  then  to  again  do 
nothing,  and  finally  to  replace.  Figure  6.1  is  an  illustration  of  this 
type  of  policy. 


Number 

of 

Periods 

Since 

Last 

Inspec¬ 

tion 


Figure  6.1.  4-Region  Policy 


In  Rosenfield  [1976B]  it  is  shown  that  if  assumptions  (1),  (2),  and  a 
slightly  weaker  version  of  (3)  hold,  then  a  type  of  control  limit  rule 
is  optimal.  There  is  a  state  i*(k)  for  which  it  is  optimal  to  replace 
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if  i  <  i*(k)  and  to  either  inspect  or  do  nothing  if  i  >  i*(k).  With 
the  weaker  assumptions,  it  is  not  possible  to  decide  whether  to  inspect 
or  to  do  nothing  for  i  >  i*(k). 

The  previous  models  generally  assume  that  restoration  returns  the 
system  to  its  best  state.  A  model  constructed  by  Eppen  [1965]  allows 
the  system  to  return  to  any  higher  state  at  a  cost  C(k)  where  k  -  new 
state  -  old  state,  i.e.,  k  is  the  number  of  states  by  which  the  system 
is  improved.  The  following  assumptions  are  made: 


(1) 

(2) 


(3) 


C(k)  -  Ck,  C  >  0,  k  >  0. 

A^ ,  the  one  period  operating  cost  in  state  j,  is  convex  and 
positive  with  A^  -  A^  <  -C  «>  some  kind  of  mainte¬ 
nance  will  occur  in  state  0. 


pu  >  "•  "m-1  ■  1  -  'll"  Pl]  "  0  *  3  *  1-‘- 

Also,  p  is  concave  in  i  v  i  >  1. 


The  only  optimality  criterion  considered  is  minimized  discounted  cost 
for  a  finite  time  horizon.  The  standard  recursion  becomes: 


M 

d>(i,a,N+L)  -  min  {C(j-i)  +  A  +  a  V  P..<KJ,a,N)>  . 
j>i  1  j=0  ij 

The  optimal  policy  is  similar  to  a  control  limit  rule.  If  i  <  i*  at 
the  beginning  of  the  first  time  period,  then  the  system  should  be  re¬ 
paired  to  state  i*.  If  i  i*,  do  nothing.  It  is  also  shown  that 

i\>1  3nd  ^l^ 
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IC  oust  be  noted  that  even  if  a  control  limit  rule  is  the  optimal 
way  to  operate  a  multistate  component,  that  may  not  be  the  best  way  to 
run  a  system  composed  of  multistate  components.  This  is  shown  in 
Example  4  of  Denardo  [1967]. 


-126- 


7.  RESTORATION  WITH  THE  SYSTEM  INOPERABLE 


This  chapter  pertains  to  the  optimal  restoration  of  constantly 
monitored  equipment  with  the  system  inoperable.  In  Section  6.2  the 
components  were  periodically  inspected  which  allowed  the  system  to  be 
treated  as  a  discrete  time  Markov  decision  process.  In  this  chapter 
assumptions  are  made  which  allow  the  optimal  operation  of  a  multistate 
system  to  be  treated  as  a  continuous  time  Markov  decision  process.  A 
decision  is  made  in  each  state  to  either  continue  operation  or  to 
perform  a  restoration  activity.  It  is  shown  that,  when  restoration 
costs  are  constant,  the  restoration  process  should  always  return  the 
system  to  its  best  state.  Control  limit  rules  are  optimal  in  most 
cases. 

Section  7.1  extends  the  discrete  inspection  model  by  permitting  the 
restoration  activity  to  fail.  Thus,  it  may  require  more  than  one  time 
period  to  perform  the  Intended  restoration.  If  the  state  occupation 
costs  are  not  paid  while  the  system  is  being  restored,  a  control  limit 
rule  is  shown  to  be  optimal.  However,  if  the  state  occupation  costs  are 
paid  while  the  system  is  being  restored,  then  the  restoration  epoch  must 
be  shorter  than  the  exponential  holding  time  in  any  state  for  a  control 
limit  rule  to  be  optimal.  In  Section  7.2  the  continuous  time  model  for 
constantly  monitored  components  is  described.  Using  a  known  equivalence 
between  discrete  and  continuous  time  Markov  decision  processes,  it  is 
shown  that  this  problem  is  equivalent  either  to  the  discrete  time  model 
contained  in  Section  7.1  or,  with  different  assumptions,  to  the  basic 
model  discussed  in  Section  6.2.  This  equivalence  establishes  the 
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optimality  of  a  control  limit  rule  in  the  continuous  time  case.  Some 
examples  are  presented  in  Section  7.3.  The  last  section  of  Chapter  7  is 
devoted  to  showing  that  the  continuous  time  model  can  be  considered  as  a 
special  case  of  a  shock,  model.  The  shock  model  lias  some  generality  not 
contained  in  the  continuous  model,  and  its  optimal  operation  is 
described  by  a  control  limit  rule.  However,  the  shock  model  does  not 
incorporate  a  utility  function  into  its  cost  structure  nor  is  it  obvious 
how  that  would  be  accomplished .  Thus,  Section  7.2  is  necessary  to  show 
control  limit  rule  optimality  for  more  complicated  cost  functions. 

7.1  Failure  to  Replace  Model 

In  this  section  a  model  similar  to  the  one  in  Kolesar  [1966]  is 
analyzed.  The  difference  between  this  model  and  previous  models  of  this 
type  is  that  restoration  is  allowed  to  fail,  i.e.,  there  is  a  certain 
probability  that  the  attempted  system  restoration  will  not  be  completed 
in  a  single  period.  This  model  is  interesting  by  itself,  and  it  will 
turn  out  to  be  the  appropriate  generalization  when  a  continuous  model  is 
analyzed  using  discrete  methods.  The  main  result  is  that  a  control 
limit  rule  with  return  state  M  is  optimal  when  the  state  occupation 
costs  are  not  paid  during  the  restoration  process.  The  return  state  is 
defined  as  the  state  in  which  the  process  will  be  immediately  following 
a  successful  repair  or  replacement.  An  example  is  given  to  show  that  a 
control  limit  rule  is  not  necessarily  optimal  when  state  occupation 
costs  must  be  paid  during  restoration. 

Note ;  The  words  repair,  replace,  and  restore  are  used  interchangeably 
throughout  Part  II  of  this  thesis.  An  effort  has  been  made  to  use  the 
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word  restoration  when  talking  in  general  terms,  to  use  the  word  repair 
when  the  system  is  not  returned  to  its  best  state  or  continues  to 
operate  during  restoration,  and  to  use  the  word  replace  when  the  system 
is  returned  to  its  best  state  and  is  rendered  inoperable  by 
restoration.  The  words  system  and  component  are  also  used 
interchangeably.  The  results  herein  apply  to  any  process  with  state 
space  {0,1,..., M},  which  could  be  either  a  single  multistate  component 
or  a  system.  They  do  not  necessarily  apply  to  a  system  consisting  of 
raultistate  components. 

Let  C  >  0  be  the  cost  of  replacing  a  component  with  a  penalty  cost 
of  K  _>  0  for  replacement  from  state  0,  and  let  A^  be  the  one-period 
state  occupancy  costs  in  state  i  (U^  =  -A^  is  interpreted  as  the 
utility  in  state  i) .  Transitions  among  states  are  Markovian,  and  a 
is  the  discount  factor.  Replacement  must  always  be  attempted  from  state 
0.  Let  p  be  the  probability  that  a  planned  replacement  succeeds. 

It  is  assumed  that  if  a  failure  to  replace  occurs  in  any  period,  the 
probability  that  a  planned  replacement  succeeds  in  the  next  period  is 
still  p.  This  may  not  be  entirely  realistic,  but  it  is  necessary  for 
a  simple  model.  First  assume  that  state  occupancy  costs  are  not  paid 
during  replacement.  The  standard  dynamic  programming  recursion  is: 

n 

(J>(i,a,N+l)  «  min{A  +  a  £  p  ,.<!>(  j  .  a,N) ; 

1  j  =  0  1J 

C  +  ccpr<KM,a,N)  +  a(  1-jT)  (Ki  >  <*,N) }  for  i  >  0 
-  C  +  K  +  ap4»(M,a,N)  +  a(l-p)<K0,a,N)  for  i  -  0.  (7.1) 
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Equation  (7.1)  is  valid  when  replacement  cost  is  paid  even  though  the 
attempted  replacement  fails.  If  replacement  cost  is  paid  only  when  the 
replacement  is  successful,  the  second  term  in  the  minimization  would 
become  pC  +  ap4<(M,a,N)  +  a(  1— p )  i  ,  a  ,N)  .  The  analysis  will  be 
identical  regardless  of  what  option  is  selected. 

The  following  assumptions  are  necessary. 


(1) 

A. 

l 

is  nonincreasing  in  i. 

(2) 

vk 

zj=0 

pij 

is  nonincreasing  in 

(3) 

(  n ) 
pi0 

>  0 

for  some  n  >  1. 

From  condition  (2)  it  can  be  shown  (see  Derman  [1970],  page  123)  that 
is  nonincreasing  in  i  for  every  nonincreasing  function 
f(«).  It  can  also  be  shown  that  a  random  variable  with  density 
P(X^  =  j)  =  P^  has  an  IKK  distribution.  The  third  assumption  ensures 
that  replacement  will  eventually  occur;  thus,  system  operation  is  a 
regenerative  process.  As  discussed  in  Section  6.1,  only  policies  which 
are  deterministic  and  which  do  not  depend  on  the  past  history  of  the 
process  (class  C^)  need  be  considered.  The  inductive  argument 
contained  in  Theorem  7.1  will  often  be  used  to  prove  theorems  concerning 
control  limit  rules  and  restoration  to  state  M. 


Theorem  7.1:  If  conditions  (1),  (2),  and  (3)  hold  and  total  discounted 
cost  is  the  optimality  criterion,  then  state  M  is  the  optimal  return 

state. 


Proof :  Since  restoration  to  any  state  i.;  fllowed.  Equation  (7.1) 

becomes 
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for  i  >  0 


4<(  i  ,a ,N+1 )  »  min{Ai  +  a  ^=Q  p  <l»(j  ,a,N)  ; 

C  +  apmin [  <ji(  Jl  ,a  ,N)  ]  +  a(  1— p ) c^( i  ,a,N)  } 

JUS 

m  C  +  K  +  apmin[^(A,a,N)  ]  +  a(  1-p)  4,(0  ,  a  ,N)  for  i  >  0  . 

ieS  ~ 

4,(1, a, 0)  =  min{Ai,C}  is  clearly  nonincreasing  in  i,  and  it  will  be 
shown  that  is  true  for  every  N.  Assume  inductively  that  4,(1, a, n)  is 

rti 

nonincreasing  in  i  v-  n  =  0,  1 ,  . . . ,  N.  Then  ij=0  Pij‘Kj*a»N)  is 
nonincreasing  in  i  from  assumption  (2)  and  A^  is  nonincreasing  in  i 
from  assumption  (1).  Thus,  every  term  in  the  above  minimization  is 
nonincreasing  in  i,  so  4>(i,a,N+l)  is  nonincreasing  in  i.  By 
induction,  <J>(i,a,N)  is  nonincreasing  in  i  v  N.  Then 

4>(i,a)  =  lim  4>(i,a,N)  is  nonincreasing  in  i  (see  Derman  [1970],  page 

37,  for  a  proof).  Thus,  the  optimal  return  state  is  given  by 

<l(M,cc)  =  min[(lU,a)  ] .  3 
US 

The  proof  of  Theorem  7,1  applies  anytime  (j»(i,a,N)  is  nonincreasing 
in  i  for  every  N.  This  will  be  the  case  in  3ll  the  theorems  pertain¬ 
ing  to  control  limit  rule  optimality.  To  avoid  unnecessary  repetition, 
whenever  a  theorem  states  that  state  M  is  the  optimal  return  state 
using  the  discounted  cost  criterion.  Theorem  7.1  will  be  referenced. 

For  simplicity,  will  replace  min[4>(Jl,a) !  in  the  proofs  these 

it  S 

theorems.  The  same  remarks  apply  to  Theorem  7.3  when  the  average  cost 
criterion  is  being  used  in  place  of  the  discounted  cost  criterion. 

Lemma  7.2  is  a  well  known  Abelian  theorem  (see  Derman  [1970],  page  144). 
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Lemma  7.2:  Let  {a  ,n“0 , 1 , . . . )  be  a  sequence  of  real  numbers,  and  let 

f(X)  =  anXn,  0  <  X  <  1.  If  lim  ( 1/N)  I^=0  an  =  A  <  ",  then 

lim(l-X)f(X)  =  A. 

X+l 


Let  W  be  the  (finite)  expected  reward  earned  in  period  n  when 
n 

using  policy  R  with  initial  state  i.  Then 

*R(i)  =  1/(^0  to  Wn  ^  =  Co  a\*  ^tUng 

X  =*  a  and  a  =  W  in  Lemma  7.2,  $  (i)  =  lim(l-a)ii»  (i,a). 

n  n  R  a+i  R 

Theorem  7.3:  If  conditions  (1),  (2),  and  (3)  hold,  and  expected  average 
cost  is  the  optimality  criterion,  then  state  M  is  the  optimal  return 
state. 

Proof:  From  Theorem  7.1  the  policy  R(  a)  which  minimizes  <p(i,a)  must 

have  return  state  M.  Since  S  and  A  are  finite  and  only 

non-randomized  policies  need  be  considered,  there  are  a  finite  number  of 

policies.  Thus,  there  must  be  a  policy  R*  which  has  return  state  M 

and  a  sequence  of  discount  factors  (a  ,n=l,2,...}  such  that  lim  a  =  1 

n  j-j  -y  co  n 

and  R(a^)  =  R(a^)  =  •••  =  R*  is  the  optimal  policy  for  each  a^.  For 

any  other  policy  R  and  any  a  ,  (l~a  )  <b  (i,a  )  >  (1-a  )  <J»  (i  a  ). 

n  nR  n  —  n  R*  n 

From  Lemma  7.2, 

♦B(i)  =  lira(l-a  H .  (i,a  )  >  lim(l-a  )<k.  .(i,a  )  =  4>  (i). 

k  j^^co  n  k  n  ^  4cd  n  k.  n 

Thus,  R*  is  optimal.  □ 

Normally,  R*  will  be  optimal  for  all  sufficiently  small  interest 
rates  as  well  as  being  optimal  in  the  average  cost  case. 
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Theorem  7.4:  If  conditions  (1),  (2),  and  (3)  hold,  there  is  a  control 


limit  policy  which  minimizes  <j>(i,a). 

Proof :  From  Theorem  7.1,  <i(i,a)  is  nonincreasing  in  i  and  can  be 
written 

M 

(Ki,a)  -  min{Ai  +  a  p^iKj.a); 

C  +  ap<l>(M,a)  +  a(  l-pH(i  ,a)  for  i  >  0 

=  C  +  K  +  ap4;(M,a)  +  ( l-pH(0,ct)  for  i  ■  0 

Assume  there  exists  an  i*  >  0  such  that  replacement  is  better  than 
inaction  (if  not,  replacing  only  in  state  0  is  trivially  a  control 
limit  rule).  Then: 


<Ki*,at) 


C  +  ap^(M,a)  +  a(  l-pH(i*,a)  <  A^ 


M 

+  a  l  P4*>(j  ><*) 
J-0  1  3 


Solving  for  <J>(i*,a)  yields: 

<i)(i*,a)  -  [C+apcKM,a)]/[l-a(l-p)] 

Since  4i(i*,a)  is  independent  of  i*,  set  4>C i , at)  «•  4>(i*,a)  v  i  i*. 
Since  4>(i,a)  is  nonincreasing  in  i,  this  must  be  the  optimal  policy. 
Thus,  it  is  optimal  to  replace  v  i  £  i*  which  is  a  control  limit 
rule.  0 


Theorem  7.5:  If  conditions  (1),  (2),  and  (3)  hold,  there  is  a  control 
limit  rule  which  minimizes  <> ( i ) . 
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Proof :  From  Theorem  7.4  there  is  a  control  limit  policy  R(  a)  which 
minimizes  4>(i,a).  Since  there  are  a  finite  number  of  control  limit 
policies,  there  must  be  a  control  limit  policy  R*  and  a  sequence  of 
discount  factors  {a  ,n«0,l,...}  such  that  lim  a  «  l  and 


n 

R(Oj)  =  RU?)  -  -  R* 

other  policy  R  and  any 
Lemma  7.2, 


nxx>  n 

is  the  optimal  policy  for  each  a For  any 

a  ,  (1-a  H  (i,a  )  >  (1-a  )<L  .(i,a  ).  From 
n  nR  n—  n  R*  n 


V1’ 


lim(l-a  H  (i,a  )  >  lim(  1-a  )  (k  (i ,  a  )  =  <}>  (i ) 
n  k  n  —  n  -xx)  n  n 


Thus,  R*  is  optimal,  u 


The  preceding  theorems  have  shown  that  a  control  limit  policy  is 
optimal  when  state  occupancy  costs  are  not  paid  during  replacement.  If 
those  costs  are  paid  during  replacement,  the  standard  recursion  becomes 


4>(i,a)  =  min  (A^  +  a  p 

C  +  A^  +  ap<KM,at)  +  a(l-p)  iK  i ,  a) }  for  i  >  0 

=  C  +  K  +  Aq  +  ap4>(M,a)  +  a(l-p)(K0,a)  for  i  =  0 


If  <|>(i*,a)  is  determined  as  in  Theorem  7.4,  then 
<Ki*,a)  =  [C+A^+aptKM.a)  ]/[l-a(l-p)  ] 

This  is  no  longer  independent  of  i*  because  of  the  A  term  on  the 

i* 

right  hand  side,  and  the  analysis  used  in  Theorem  7.4  fails.  The 
following  example  shows  that  a  control  limit  rule  may  not  be  optimal  in 
this  case. 
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Example  7.1:  Consider  the  4-state  DTMC  shown  in  Figure  7.1. 


4»(3,a)  =  A3  +  ap32<K2,a)  +  ap3i<J»(l,a) 

<K2,a)  =  minO^  +  C  +  A2  +  ap<|)(3,a)  +  a(l-p)  <J^2 ,  a) } 

^(l, a)  =  minfAj  +  a(K0,a);  C  +  A^  +  ap^(3,a)  +  a(l-p)  1 ,  a) } 

<l>(0,a)  =  C  +  AQ  +  apci)(3 ,  a)  +  a(l-p)  <K0 ,  a) 

4»(  3  ,  a)  *  -12.38 

4i(2,a)  =  min {-9. 37;  -10.79}  »  -10.79  (replace) 

<Kl,a)  =*  min{l. 27;  2.22}  =»  1.27  (do  nothing) 

4>(0 ,  a)  =  2.54. 

Thus,  it  is  optimal  to  do  nothing  in  states  1  and  3  and  to  replace  from 
states  0  and  2  which  is  not  a  control  limit  rule.  □ 

In  Example  7.1  it  is  the  high  discount  rate  (1-a  =  .5)  which 
causes  the  non-optimality  of  control  limit  rules.  It  can  be  shown,  for 
a  =  .9,  that  it  is  optimal  to  replace  in  states  0,1,  and  2  which  is  a 
control  limit  rule.  Thus,  there  is  still  hope  that  a  control  limit  rule 
is  optimal  in  the  average  cost  case.  Unfortunately,  that  is  not  true 


either  as  shown  by  Example  7.2  in  Section  7.3. 


'.2.  Equivalence  of  the  Continuous  and  Discrete  Models 

The  subject  of  this  section  is  the  equivalence  between  a 
continuous  time  Markov  decision  process  (CTMDP)  and  a  discrete  time 
Markov  decision  process  (DTMDP).  By  equivalence  it  is  meant  that  the 
two  processes  have  identical  optimal  policies  and  identical  minimum 
cost.  This  equivalence  is  due  to  Serfozo  [1979].  It  is  useful  since  it 
allows  a  constantly  monitored  system  to  be  treated  as  a  periodically 
inspected  system. 

Let  X  «  (S,A,r,p,a)  be  a  DTMDP,  and  let  Y  =*  (S,A,  r,X,  p,p)  be 

a  CTMDP.  The  notation  presented  in  Section  6.1  is  used  throughout  this 

Chapter.  Let  y  -  sup  X  (a).  An  equivalence  between  X  and  Y  is 
i.a  1 

given  by  Theorem  7.6. 


Theorem  7.6  (Serfozo  1979]):  Let 
with  y  <  »  and  countable  S  and 
with 


Y  -  (S,A,  r,X,  p,0)  be  a  CTMDP 
A.  Let  X  =  (S,A,r,p,a)  be  a  DTMDP 


rt(a) 

‘V*' 


y/ (y+P) 

j  rt(a)[p+Xt(a)]/(p+y) 

(  ri(a)X1(a)/y 

(  Xi(a)  Pjj (a)/y 
l 1  -  Xi(a)ll-  plt(a)]/y 


for  discounted  rewards 
for  average  rewards 

if  j  +  i 
if  j  -  i  . 


If  Y  and  X  are  both  controlled  by  stationary  policy  R,  then 
WR(i,p)  -  <|/R(i,a)  and  VR(i)  -  y4>R(i). 
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Remarks : 


(1)  The  requirement  that  S  and  A  be  countable  is  necessary  only 
to  avoid  technical  details.  They  are  finite  for  the  cases  considered 
herein. 

(2)  The  condition  y  <  <*>  means  that  the  exponential  holding  time 
parameters  must  be  uniformly  bounded  which  is  equivalent  to  saying  that 
the  Markov  process  is  regular.  This  is  trivially  satisfied  in  this 
thesis  since  S  is  finite. 

(3)  For  p  =•  0  (a=l),  the  rewards  in  the  discounted  case  become  the 
rewards  in  the  average  value  case.  Because  of  this,  only  the  discounted 
rewards  will  be  expressly  written  when  applying  Theorem  7.6. 

(4)  Since  both  Y  and  X  have  the  same  state  and  action  spaces, 
the  same  policies  apply  to  both.  Only  stationary  policies  need  be 
considered,  and  there  are  a  finite  number  of  those.  Theorem  7.6  says 
that  if  a  stationary  policy  is  optimal  for  one  system,  then  the  same 
stationary  policy  is  optimal  in  the  other  system.  Thus,  if  a  CTMDP  is 
converted  to  a  DTMDP  and  something  is  proven  regarding  the  optimal 
policy  for  the  DTMDP,  then  that  is  also  true  of  the  optimal  policy  for 
the  CTMDP. 

(5)  The  rewards  for  the  CTMDP  in  Theorem  7.6  are  lump  rewards 
rather  than  reward  rates.  If  r(a)  are  reward  rates,  the  appropriate 
DTMDP  rewards  are  r^Ca)  =  r^aJ/CfS+y) . 

Consider  a  constantly  monitored  system  with  S  ■  {0,1,..., M}  and 
A  -  {0  -  do  nothing,  1  »  repair}.  Let  C  ^  0  be  the  replacement  cost 
rate  with  a  penalty  cost  rate  K  >  0  for  replacement  from  state  0,  and 
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let  A.  be  the  state  occupancy  cost  rate.  Let  A  =  A  be  the 
1  lM 

replacement  rate  from  all  states.  Normally  replacement  takes  a 
specified  time  T  in  which  case  A  =  1/T.  This  won't  affect  the 
results  since  they  are  in  terms  of  expected  values.  Let  A  ,  j  <  i,  be 
the  transition  rates  when  replacement  is  not  transpiring.  Now  consider 
a  DTMDP  with  S  and  A  as  above,  and 


Pij(0) 


if  jM 

1  -  Ai#/y  if  j  =*  i 


V‘> 


0  if  j  i  i.M 

A/y  if  j  -  M 

1  -  A/y  if  j  =  i 


r^O)  =  A1/(&+y) 


( C/(P+y)  for  i  >  0 

r  U)  =  } 

{  (C+K)/ ( 0+y)  for  i  *  0 


The  rewards  r^(l)  were  calculated  assuming  that  state  occupancy  costs 
are  not  collected  during  replacement.  If  those  costs  are  collected 
during  replacement, 


ri(1) 


C/Y  +  A./y 


for  i  >  0 


(C  +  K)/y  +  A/y  for  i  =  0  . 


It  is  always  true  that  pij^^  is  nonincrea8ing  in  i  since 

transitions  occur  only  when  j  “  i  and  j  *  M.  Obviously,  one 
condition  which  ensures  that  Py(0)  is  nonincreasing  in  i  *  k 

is  l)mQ  Ay  nonincreasing  in  i  -v  k.  This  is  very  restrictive. 
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however.  One  of  the  implications  of  this  condition  is  that,  with 
k  ■  M,  l£0  Xjy .  A  more  natural  condition  which  ensures  that 

Ij=g  P^CO)  is  nonincreasing  in  i  -v  k  is  given  in  Lemma  7.7. 

Lemma  7.7:  If  *  j  <.  1  <  *.  then  Ij»0  PyCO)  is 

nonincreasing  in  i  v  k. 

Proof :  Consider  any  Xj.0  p^CO)  and  f^mQ  p^lO)  with  i  >  i. 

If  k  ^  i,  then  Ij.g  P^j (0)  **  1,  and  the  result  holds  trivially.  If 
k  <  1,  then 

Ij»o  a  £j=o  Xjy/Y^  ^j-o  Xij/Y  *  ^j-o  pij (0) *  D 

The  following  assumptions,  which  are  the  continuous  equivalents  of 
the  assumptions  in  Section  7.1,  are  necessary. 

(1)  is  nonincreasing  in  i. 

(2)  l  X.^  is  nonincreasing  in  i  »  k,  or 

hj  >  -  i  <  i  <  *• 

(3)  X,  ,  X  X 

i,i^  ipi2  in  q  >  0  for  80me  sequence 

i^,  ....  i^.  This  assumption  simply  assures  that  state  0  will 
eventually  be  reached  if  no  replacement  is  performed. 

Theorem  7.8:  If  (1),  (2),  and  (3)  hold  and  costs  A^  are  not  paid 
during  replacement,  then  a  control  limit  rule  with  return  state  M  is 
optimal  for  both  the  discounted  and  average  cost  criteria. 

Proof:  In  the  discounted  case,  the  recursion  is 
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i 

<J»(i,a,fH-l)  -  min{A  /(P+y)  +  a  J  p  , .  ( 0) 4>(  j  ,a,M) ; 

1  j-0  ij 

C/(pty)  +  aX<J)(M , a,N)/ y  +  a(l-X/y)<J;(i,ot,N) }  for  i  >  0 

-  (C+K)/ ( p+y)  +  a\(KM,a,N)/y  +  a(l-X/y)<K0,a,N)  for  i  =  0. 

With  p  “  X/y,  Theorems  7.1,  7.3,  7.4,  and  7.5  of  Section  7.1  are 
applicable.  The  limiting  argument  is  still  valid  since  the  average 
rewards  are  the  limi  t  as  a  ■*>  1  ( (3>0)  of  the  discounted  rewards.  Q 

Theorem  7.9:  If  (1),  (2),  and  (3)  hold,  if  costs  are  paid  during 

replacement,  and  if  X  X  *  i,  then  a  control  limit  rule  with 

return  state  M  is  optimal  for  both  the  discounted  and  average  cost 
criteria. 

Proof :  If  X  2  v  i,  then  y  =  sup  X^a)  =  X,  and  P^O)  =  1 

i ,  a 

v  i.  The  recursion  in  the  discounted  case  then  becomes 

i 

<|>(i,a,N+l)  “  min{A  /(f3+y)  +  a  £  p  (0)i^( j  , a,M)  ; 

j-0 

C/(  P+y)  +  A  /(pfy)  +  a<l>(M,a,N)}  for  i  >  0 

-  (C+K)/(P+y)  +  Aq/ (f5+y)  +  a<J;(M,<x,N)  for  i  =*  0  . 

Since  there  is  no  possibility  of  replacement  failure,  this  recursion  has 
the  same  form  as  the  recursion  in  Kolesar  [1966).  The  result  follows 
from  Theorem  l  of  that  paper  and  Theorem  7.1  of  Section  7.1.  0 
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7.3.  Examples 

In  Theorem  7.9  it  was  necessary  to  assume  X  X  -v  i,  i.e.  , 
the  replacement  rate  is  larger  than  the  sum  of  the  failure  rates  from 
any  state.  This  is  a  reasonable  assumption  since  most  components  will 
be  quickly  replaced;  otherwise,  their  availability  could  be  very  poor. 
However,  if  X  <  X  for  some  i  and  state  occupancy  costs  are  paid 
during  replacement,  then  a  control  limit  rule  is  not  necessarily  optimal 
as  shown  by  Example  7.2. 


Example  7.2:  Consider  the  4-state  CTMC  shown  in  Figure  7.2. 


Let  K 


•  1,  X, 


.9,  X, 


4.5,  Xn 


.5,  X, 


.5, 


32  ’31  21  ’20  10 

X  =  2  (or  T  =  .5),  C  =  1.5,  K  -  0,  AQ  -  0,  -  -1,  A2  =  -2,  and 

A^  =  -2.01.  Average  expected  cost  is  the  optimality  criterion.  From 

the  theory  of  regenerative  processes,  average  expected  cost  can  be 

calculated  from 


E(average  cost)  »  E(cost  per  cycle)/E(cycle  length). 


State  3  is  considered  as  the  return  state.  Replacement  may  occur  from 
the  following  groups  of  states: 

{3, 2, 1,0),  {2,1,0},  {2,0},  {1,0},  and  {0}. 
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Calculations  are  shown  below.  The  results  appear  In  Table  7.1.  Let 


E(cost)  =  E(cos t  per  cycle),  and  E(cycle)  =  E(cycle  length), 


(1)  {3,2, 1,0)  E(cost)  -  C  +  A^t  -  .495 


E(cycle)  -  T  -  .5 


(2)  {2,1,0}  E(cost)  -  C  +  -r — -  A.T 

*32+*3 1  *32+*31  2 


*32+*31  AjT  "  _1'06 


E(cycle)  -  T  +  l/(\32+\31)  =  1.5 


(3)  {2,0}  E(cost) 


C  +  X — TT~  ~ 
X32  X31 


32 

*32+Sl 


A2T 


K,  A 

-  U— +  V) 


*31+*31  *10 


=  -2.41 


E(cycle)  -  T  +  -r——  +  J-~T-  (vM  =  3.3 
*32+*3 1  3  2+*3 1  *10 


(4)  {1,0}  E(cost)  -  C  +  r-4r~  +  \~TX — 

*32  *31  *32  *31  *21  *20 


+  _ - —  A,  T  +  20 


*21+*20  1  *2 1+*20  ° 


AoT] 


31 


*32+*31  1 


A.T  -  -1.045 


-142- 


(5)  {0} 


E( cycle) 


E(cost) 


E(cycle) 


1 

f  1  1 

X32+X31 

+ 

X32+X31 

l  \  +X  ' 
*21*20 

A3 

X32 

r  A2 

X32+X31 

x32+X3i 

*2i+*20 

+  X£1 

A, 

(  1  n 

X2i+x20 

U  '  J 
*10 

.  X31 

A1 

j.  a 

T  =  -2.53 

\  +x  xx  j  T  nn 
X32  X31  *10  ° 

1 

+  X32 

r  1 

*32+X31 

*32+*31 

"X21+*20 

X21 

r  1  yi 

X21+X20 

It  !  ‘ 

*10 

1.52 


+  _ £i_  f  _L_)  =  35 

\  +x  v x  '  J • D 
*32  *31  *10 


Set  of  States 

{3, 2, 1,0) 

(2,1,0) 

(2,0) 

(1,0) 

(0> 

E(cost)/E(cycle) 

.99 

-.71 

-.73 

-.69 

-.72 

Table  7.1  Cost  Comparison 


The  minimum  expected  average  cost  is  achieved  by  inaction  in  states  1 

and  3  and  replacement  in  states  0  and  2.  This  is  clearly  not  a 

control  limit  rule.  It  can  be  shown  that,  for  \  >  5  -  max  ,  a 

i  i* 

control  limit  rule  is  optimal.  The  results  of  this  example  are  also 
valid  for  the  discounted  case  with  a  small  discount  rate.  C 


3- 


It  may  seem  unreasonable  that  the  cost  of  replacing  a  component  is 
constant  except  for  a  penalty  cost  when  replacing  from  state  0.  The 
component  might  have  a  salvage  value  so  that  replacement  cost  C,  is 
nonincreasing  in  i.  Unfortunately,  this  does  not  necessarily  lead  to  a 
control  limit  rule  as  shown  in  Example  7.3. 

Example  7.3:  Consider  the  DTMC  shown  in  Figure  7.3. 


Let  C3  ”  C2  =  C*  and  let  C^  =  C^  =  C  +  K  with  C  =  1  and  K  =  10. 
This  might  occur  if  states  1  and  0  were  very  undesirable.  Let 

'i  '  0  ’  P33  '  -9'  P32  '  •09’  P3 L  ‘  -0'-  p2 1  *  '5'  P20  ‘  ’5'  anp 

P^q  =  1.  Compute  E(average  cost)  =  E(cos t) /E(cyc le)  as  in  Example  7.2 

with  state  3  as  the  return  state.  Possible  groups  of  states  to 
replace  from  are  {3, 2, 1,0},  {2,1,0},  {2,0},  {1,0},  and  {0}.  Calcula¬ 
tions  are  shown  below,  and  the  results  are  contained  in  Table  7.2. 

(1)  {3, 2, 1,0}  E(cost)  =  C  =  1 

E(cycle)  =  1 


(2) 

{2,1,0} 

E(cost) 

=  c  +  p31k=  1.1 

E(cycle) 

=  1  +  p31  +  P32  -  1.1 

(3) 

{2,0} 

E<C08t) 

-  c  +  p31k=  1.1 

E(cycle) 

"  1  +  2p31+P32“  K11 
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(4)  {1,0} 


E(cost)  =  C  +  (p31  +  P32)k  =  2 
E( cycle)  =  1  +  p31  +  2p32  =  1.19 
(5)  {0}  E(cost)  =  C  +  (p31  +  p32>K  =  2 

E(cycle)  =  1  +  2p31  +  P32(2p20  +  3p2i^  =  1,245 


Set  of  States 

(3, 2, 1,0} 

{2,1,0} 

I 

{2,0} 

{1,0} 

{0} 

E(cost)/E(cycle) 

1 

1 

.99 

1.68 

1.6 

Table  7.2  Cost  Comparison 

The  only  policy  which  yields  an  average  cost  less  than  one  is 
inaction  in  states  1  and  3  and  replacement  in  states  0  and  2. 
Thus,  a  control  limit  rule  is  not  optimal.  H 

The  main  reason  a  control  limit  rule  is  not  optimal  in  Example  7.3 
is  that  once  the  system  is  in  a  state  in  which  it  will  have  to  pay  the 
penalty  cost,  it  must  be  optimal  to  try  and  stretch  out  the  cycle 
length.  Average  cost  is  minimized  by  allowing  the  system  to  operate  as 
long  as  possible  before  replacement.  In  Example  7.4  the  system  is 
profitable  to  run.  The  objective  is  to  maximize  utility  instead  of 
minimizing  cost.  The  replacement  cost  is  constant,  but  the  replacement 
rate,  \  ,  is  increasing  in  i.  This  might  transpire  if  a  repairman 

was  always  at  the  site  to  fix  the  system.  Then  there  would  be  no 
additional  cost  involved  in  restoring  the  system,  but  there  might  be 
additional  time  involved  for  an  increasingly  degraded  system.  A  control 
limit  rule  is  not  necessarily  optimal  in  this  case  either. 
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Example  7.4:  Consider  the  4-state  CTMC  in  Figure  7.4  which  is  identical 
to  the  system  in  Example  7.1. 


Figure  7.4  CTMC 


Let  C  »  2,  K  =  0,  U  =  i  v  i  (linear  utility  function),  =  •!» 

X31  =  *9,  X21  =  ’5*  X10  ’  *5,  X20  =  *5,  X33  X23  =  1  (T23  =  ° ’ 

X.  *  .02  (T  .  =  50),  and  X  *  .01  (T  =  100).  The  optimality 
13  1j  03  03 

criterion  is  again  expected  average  value,  but  this  time  the  objective 
is  to  maximize  utility.  It  is  assumed  that  utility  is  not  collected 
during  restoration.  Possible  groups  of  states  to  replace  from  are 
{3,2,1 ,0},  {2,1,0},  (2,0),  ll,0},  and  {0}.  EUverage  utility) 

=  E(util) /E(cycle)  where  E(util)  5  Efutility  per  cycle)  and 
E(cycle)  =  E(cycle  length). 

Calculations  are  shown  below,  and  the  results  are  in  Table  7.3. 

(1)  {3, 2, 1,0}  E(util)  =  -  C  =  -2 

E(cycle)  =  1 / X ^  =  i 

(2)  {2,1,0}  E(util)  =  U3/(\32+X3l)  -  C  =  1 


E(cycle)  =  y — ^ —  +  t- — ^ —  T23 
X32  X31  X32  X3I  23 


31 


vsr  13 


T,  ,  **  46. 1 
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(3)  {2,0}  E(utll) 


3  +  Si  ,U1 


\  +X  '  \  +X  S  )  -  C  =  3.8 
X32  Si  X32  Si  X10 


E(cycle)  -  +  y-|£-  T 


32  31  32  31 


23 


+  rjx  (t~  +  t03)  =  92-9 

32  11  So  03 


(4)  {1,0}  E(util) 


32 


X  -i  ^ 


'32 


X32+Sl  X21+X20 


)  -  C  -  1.2 


E(cycle) 


32 


21 


X32+X31 


X31+X32  X21+X20 


X2I+X20  13 


+OTI-»,+Ar^-98-6 


(5)  {0}  E(util)  - 


U 


,  S->  U7  X  U, 

3  L  32  r  2  .  21  r  1  11 

+  — — rr~  L\— rr—  +  3 — n —  S — )J 


X32+X3l  X32+X31  X21+x20  X2  1+X20  X1 0 


X,.  U. 

+  r~rr-  hr-)  -  c  =  UA 
12  11  So 


E(cycle)  = 


32  c  1 


X32+X31  X32+X31  LX21+X20 


+ _ — _  (_!_)] 

Si+So  So 


31  f  1 


X32+X31  ^X10 


(iT-)  +  T03  =  103 
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Set  of  States 

{3,2,1 ,0} 

(2, 1 ,0) 

(2,0) 

(1,0) 

(0) 

E(util)/E(cycle) 

-2 

.0217 

.0409 

.0122 

.0398 

Table  7.3  Cost  Comparison 

Maximum  expected  utility  is  achieved  by  inaction  in  states  1  and  3 
and  replacement  in  states  0  and  2.  Again,  a  control  limit  rule  is 
not  optimal.  Q 

7.4.  Equivalence  of  the  Continuous  and  Shock  Models 

Shock  models  are  models  in  which  a  random  amount  of  damage  occurs  at 
a  random  point  in  time.  The  times  when  damage  occurs  are  called  shocks. 
The  damage  to  the  component  is  cumulative  and  eventually  causes  compo¬ 
nent  failure.  Shock  models  are  of  interest  in  reliability  since  the 
lifetime  of  a  component  in  such  a  model  is  governed  by  one  of  the 
distributions  discussed  in  Section  1.1.  The  appropriate  life  distri¬ 
bution  naturally  depends  upon  assumptions  about  shock  interarrival  times 
and  the  damage  accumulation  process.  Optimal  maintenance  policies  for 
shock  models  can  also  be  considered.  Shock  models  mimic  Derman’s  model 
in  that  the  component  can  be  replaced  at  a  cost  C  0  and  a  penalty 
K  >  0  is  levied  if  failure  occurs.  They  have  not  been  extended  to 
include  state  occupation  costs  or  variable  maintenance  costs.  Since  it 
is  assumed  that  the  state  of  the  system  is  always  known,  these  models 
apply  to  constantly  monitored  systems.  A  control  limit  rule  in  this 
setting  means  that  the  component  will  be  replaced  either  upon  failure  or 
after  accumulating  a  certain  amount  of  damage. 
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The  first  shock  model  which  will  be  useful  in  this  thesis  is  Bergman 
[1978].  Let  (X(t),  t  ^  0}  be  a  decreasing  stochastic  process  with 
state  space  (0,1,..., M}.  There  is  also  a  nonincreasing  failure  rate 
function  v(X(t))  for  which  P(T  >  t|X(s),  s  ^  0) 

»  exp  (-  Jq  v(X(s))ds  where  T  is  the  component  lifetime.  The 
failure  rate  function  is  the  rate  at  which  the  process  goes  from  a 
positive  state  to  state  0.  Bergman  shows  that  if  this  rate  is 
nonincreasing  in  X(t),  then  a  control  limit  rule  is  optimal.  Another 
model  of  this  type  is  Gottlieb  [1982].  In  this  paper  failure  can  only 
occur  when  a  shock  arrives.  The  probability  that  a  component  in  state 
j  will  survive  the  next  shock  is  denoted  R(j)  and  is  assumed  to  be 
nonincreasing  in  j .  If  the  sojourn  time  (time  before  the  next  shock) 
in  state  j  is  nonincreasing  in  j,  then  a  control  limit  rule  is 
optimal.  A  later  paper  will  show  that  this  result  remains  valid  when 
partial  repair  of  the  component  between  shocks  is  permitted. 

A  CTMDP  can  be  considered  as  a  special  case  of  a  shock  model  when 


shock  interarrival  times  are  exponential  with  parameter  ar>d  the 

damage  distribution  is  p  ■  Decisions  are  made  after  a 

shock  occurs.  From  Bergman  [1978],  if  \  is  increasing  in  j,  a 
control  limit  rule  is  optimal.  This  result  weakens  the  hypothesis  that 
X  be  nonincreasing  in  i  *  k,  and  it  also  weakens  the 
discrete  hypothesis  that  Ij.g  Pjj  nonincreasing  in  i  *  k. 

This  change  makes  sense  because  the  objective  is  to  minimize  restoration 


cost,  and  the  only  additional  cost  is  incurred  when  the  process  is  in 
state  0.  The  result  from  an  extension  of  Gottlieb  [1982]  is  that 


control  limit  rules  are  still  optimal  when  transition  rates 


with 
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j  >  i  are  permitted.  Unfortunately,  the  shock  models  do  not 
incorporate  utility  functions  nor  is  it  obvious  how  that  would  be 
accomplished.  Thus,  Section  7.2  is  necessary  to  establish  the 
optimality  of  control  Limit  rules  in  the  CTMDP  with  state  occupancy 
costs. 
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8.  REPAIR  DURING  SYSTEM  OPERATION 


The  cost  and  risk  associated  with  system  failure  makes  highly 
reliable  systems  very  desirable.  One  way  to  achieve  high  reliability 
without  sacrificing  availability  is  to  allow  multistate  comj/onents  or 
systems  to  continue  operation  during  repair.  Only  one  model  addressing 
this  issue  has  been  found  in  the  literature  (Smith  [1978]).  It  is 
assumed  throughout  this  chapter  that  the  failure  and  repair  rates  of  a 
component  are  constant  in  time.  This  assumption  allows  the  use  of 
continuous  time  Markov  chains  in  modeling  the  system. 

In  Section  8.1  a  model  identical  to  the  model  in  Section  7.2,  except 
that  repair  occurs  during  system  operation,  is  considered.  It  is  shown 
that  a  control  limit  rule  is  optimal.  The  remainder  of  the  chapter  is 
devoted  to  the  following  problem.  Assume  that  a  component  is  in  state 
J  and  a  decision  must  be  made  to  repair  either  to  state  J  +  I  or  to 
state  J  +  H  where  H  >  I.  This  is  similar  to  determining  whether  it 
is  better  to  repair  all  at  once  or  in  stages  when  failures  may  occur 
during  the  repair  process.  When  certain  assumptions  are  valid,  it  is 
optimal  to  repair  to  state  J  +  H,  the  better  state.  These  assumptions 
are  (1)  an  increasing  utility  function,  (2)  the  failure  or  repair  rate 
into  a  state  increases  as  the  distance  to  the  current  state  decreases, 
and  (3)  the  repair  rate  from  J  to  J  +  H  must  be  at  least  as  large  as 
the  repair  rate  from  J  to  J  +  I.  Examples  are  given  to  show  that  it 
is  not  necessarily  optimal  to  repair  to  the  better  state  when  any  of 
these  assumptions  fail  to  hold.  Surprisingly,  it  is  not  necessarily 
optimal  to  repair  in  stages  even  if  the  mean  repair  time  when 
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repairing  in  stages  is  shorter  than  the  mean  repair  time  when  repairing 
all  at  once.  However,  it  may  be  better  to  repair  in  stages  even  if  the 
mean  repair  time  is  longer  than  when  repairing  alL  at  once  because  of 
the  increased  utility  associated  with  higher  states. 


8.1.  Description  of  the  Model 

The  model  considered  in  this  section  is  identical  to  the  model 
considered  in  the  previous  chapter  except  that  repair  occurs  during 
system  operation.  This  means  that  failures  may  occur  during  the  repair 
process.  Let  C  >  0  be  the  repair  cost  rate  with  a  penalty  of  K  >  0 
assessed  for  repair  from  state  0,  and  let  be  the  occupancy  cost 

rate  of  state  i.  The  failure  process  is  a  CTMC  with  transitions  A 

ij 

1  <  i,  and  repair  rate  =  A  -v  i.  If  the  chosen  action  in  state 

i  is  to  repair,  then  the  transition  from  state  i  to  state  M  is  added 

while  if  the  chosen  action  is  inaction,  the  transition  from  i  to  M 

does  not  exist.  In  this  section,  it  is  shown  that  the  optimal  policy  is 

a  control  limit  rule  with  return  state  M. 

Let  S  =  {0,1, ...,M},  let  A  =  {0  =  do  nothing,  l  =  repair},  and  let 

y  -  sup  X  (a).  The  rewards  and  transitions  for  the  equivalent  DTMDP  are 
i,a  1 

calculated  according  to  Theorem  7.6. 


"ij<0) 


V° 


V* 

for 

j 

* 

i 

‘  -  v/x 

for 

j 

a 

i 

VT 

for 

j 

* 

i.M 

*/y 

for 

j 

a 

M 

1  -  (X,  +X)/Y 

i  • 

for 

j 

- 

i 
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Theorem  8.2;  If  conditions  (L),  (2),  and  (3)  hold,  then  a  control  limit 


rule  with  return  state  M  is  optimal  in  both  the  discounted  and  average 
cost  cases. 

Proof:  In  the  discounted  case,  the  standard  dynamic  programming 
recursion  is 


<l>(i,a,N+l) 


min 


k&)  +  a  Jo  pij(0H(j*°’N); 


(  P+y) 


A. 

1 


(>Y)' 


M 

+  0  I  P,  •  ( D'Kj  ,a,N) }  for  i  >  0 
j=0  1J 


(C+K+Aq) 

(FrT 


+  a 


M 


I 

j=0 


P0j(O<Kj,a,N) 


for  i  =  0 


Since  A^  is  nonincreasing  in  i,  <Ki,a,0)  is  nonincreasing  in  i. 

Assume  inductively  that  $(i,a,N)  is  nonincreasing  in  i.  Then  by  the 

hypotheses  and  Lemma  8.1,  <l»(i,a,N+l)  is  nonincreasing  in  i,  and 

(|<(i,a)  =»  lim  4i(i,a,N)  is  nonincreasing  in  i.  Theorem  7.1  then  applies 
N-w 

to  show  that  state  M  is  the  optimal  return  state.  The  recursion  for 
<Ki,a)  for  i  >  0  is 


i-i  1- A  # 

+  7  l  ^Wj.a)  +  a  - ~  <K i , a) 

Y  j  =  0  13  Y 


c 

P+Y 


+ 


A  i-1  1-A 

+vj0  xii*(J'a)  +  -  — r~  *(1-»> 


+  O  ~  4KM,a) } 
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A  i-1 

-  TFr)  +  r  j^0  Yj*(J’a)  + 

+  min  {O;  +  a  ~  (KM,  a)  -  a  ^  (p(i,a) }  . 

Since  4>(i,a)  is  nonincreasing  in  i,  if  the  minimum  is  obtained  by 

repair  in  state  i*  >  0,  then  it  will  also  be  obtained  by  repair  in  all 

states  k  i*.  Thus,  a  control  limit  rule  is  optimal  in  the  discounted 

case.  The  result  follows  for  the  average  cost  case  from 

4>(i)  -  lira  ( l-a)4>(i ,  a) .  0 

<r+L 

8.2.  Theorem  on  Optimal  Repair  Policies 

This  section  differs  frcm  the  preceding  sections  in  that  no  cost 
structure  is  assumed.  System  operation  is  modeled  by  a  positive 
recurrent  CTMC  with  state  space  {0,l,...,M}.  The  system  operates 
during  repair,  and  the  decision  of  interest  is  the  designation  of  a 
target  state  for  the  repair  process.  There  is  a  choice  of  repairing 
from  state  J  to  state  J  +  H  or  from  J  to  J  +  I  with  I  H. 

This  may  be  thought  of  as  repairing  a  system  all  at  once  (J-U+H)  or  in 
stages  (J-M+I+J+H) .  It  is  necessary  to  assume  \j,J+H 

which  seems  counterintuitive  since  a  repair  process  which  returns  the 
system  to  a  higher  state  would  be  expected  to  take  longer.  Unfor¬ 
tunately,  an  assumption  such  as  +  l/\j+I,J+H  2.  l/\j,J+H, 

which  says  that  mean  repair  time  is  shorter  when  repairing  in  stages 
than  when  repairing  all  at  once,  is  not  sufficient.  Examples  are  given 
In  section  8.3  to  illustrate  the  problem. 
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Throughout  this  section,  system  (i)  will  refer  to  repair  from  J  to 
J+I  while  system  (2)  will  refer  to  repair  from  J  to  J+H. 

Superscripts  (1)  and  (2)  will  be  used  in  this  capacity,  e.g. , 

(1)  (1)  _  x(2)  _  x(2) 

XJ,J+I  ?  °’  J , J+H  XJ,J+I  °’  j , J+H  >  °* 

Lemmas  8.3,  8.9,  and  8.11  are  results  from  CTMC  theory.  Their  proofs 
are  contained  in  the  Appendix  since  they  are  lengthy  and  do  not  add  to 
the  results  of  this  section. 


Lemma  8.3:  Assume  X  .  <  X„,  and  X.,  <  -v  j  <  A  <  i.  Let  set 

-  ij  -  Aj  ji  -  H 

A  =  {k,k+l ,  . . .  ,M}.  Then  p. .  >  p..  **  j  <  i  <  k  where  equality 

J  A  — ’  lA 

holds  if  and  only  if  (i)  v  A  *  0,  j-1  and 

A  ®  i+l ,  k-l,  and  (ii)  X.  .  =  X..  •  If  (i)  and  (ii)  hold, 

j  A  lA 


then  pjA  =  pj+l,A 


=  p^.  Also,  if  set  B  =  {0,1, ...,k}. 


then  p^B  _<  p^  -v  k  <  j  <  i  where  equality  holds  if  and  only  if 

(i'  )  X  *  A  *  k+1 ,  . . .  ,  j-1  and  A  =  i+l ,  . .  . ,  M,  and 

(ii')  X.D  =  XJD.  If  (i ' )  and  (ii' )  hold,  then 
JB  IB 

jB  ^j+l,B  ^iB 

Notation:  p/y$)c  *  E(time  from  arrival  in  state  or  set  of  states  A 
until  the  next  arrival  in  state  or  fit  of  states  B,  given  that 
condition  C  holds). 


Theorem  8.4 :  If  X^  <  X^  and  X^  <  X^  -v  j  <  A  <  i,  then 


I  ^  J  <  l  ^2)  U  =  0,  1 . M  . 

i-k  i-k 


(The  superscripts  (1)  and  (2)  refer  to  systems  (1)  and  (2)  as  discussed 


above) . 
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Proof :  Let  set  A  =  (k,k+l 


M},  and  consider  the  case  J  <  k. 


rM 

=  ‘•i-k  wl  “  m^A^^AA'  T*le  tW°  8ysteras  are  identical  except  for 
transitions  from  state  J.  Thus,  m^^(A)  =*  m^^(A). 


u  =  u  i/  ,,P(noJ)  +  i"  p  «  .  ,+p,  >(J) 
^AA  ^AA  (noJ)  AJ  (J)  ^JA  J 


where  (J)  =  hitting  state  J  before  returning  to  set  A,  and 
(noJ)  =  not  hitting  state  J  before  returning  to  set  A.  The  only 

difference  between  end  Is  anc* 

(2)  (1)  (2) 

.  It  will  be  shown  that  pj^  _>  pj^  which  implies 

^  -  *iAA>  Whlch  lraPlles  ni1}  -  42)* 

Conditioning  on  the  first  jump  from  state  J  yields 


u>  I  ,<„■>,<!> 


Ji  iA 


_ L  xJ.J+i _  (i)  5 _ \ji _ 

J+I’#  v^+t 


^iA|(noJ)Pi(n°J)  +  UiJ|(J)  +  ^I))Pi(J)] 


where  X  «  l  X 

J  i?<  J+I ,  J1 


Rewriting  the  equation: 


(X  +x(l)  w(1) 

(  XJ*+XJ,J+I)^JA 


1  +  XJ,J+i4+I,A  +  lJJA)  Jk  XJiPi(J) 

i^J+I 


+  Jk  XJi^iA|(noJ)Pi(n°J)  +  4iJ|(J)Pi(J)] 
i^J+I 
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(X  +\(,l)T  =  1  +  X*1*  .  +  V  X  Tu  ,,  P .  (noJ) 

J*  J,J+I  X  JA  J,J+I  J+I,A  JiL  lA|(noJ)  1 


i^J+I 


(8.1) 


where 


l  XJiPi(J) 

i<k  J 

i^J+I,J+H 


(2)  Performing  a  similar  calculation  for  Pj  '  yields 


(\j*+XJ,J+H  CX^JA  1  +  XJ , J+H^J+H , A  +  j  XJi^iA|(noJ)Pi(n°J) 


i<k 
i^  J+H 


+  ^ij|(J)Pi(J) ^ 


(8.2) 


Subtracting  Equation  (8.2)  from  (8.1): 


+\(1)  -c  w(1)  -  x(1)  u(1)  =  ix  +x(2)  -c  w(2) 

(XJ*J,J+I  C X  ^JA  \j,J+I^J+I,A  (XJ*  +  \j,J+H  C X  ^JA 


X(2)  u(2) 

J , J+H^J+H , A 


(8.3) 


Now  assune  <  •£’ •  This  will  lead  to  a  contradiction. 


4?  <  “jf  a"d  V  -  cx  ■  l  V‘-VJ»  >  o 

i<k 

i^ J+I , J+H 


(since  otherwise  the  process  never  returns  to  A),  Equation  (8.3) 


becomes 
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\(l)  u(1)  -  X(1)  u(l)  >  X(2)  u(2)  -  X(2)  u(2) 

J,  J+I  \JA  J.J+I^J+I.A  J.J+ll  ^JA  J.J+H^J+H.A 


(2)  (2) 

From  Lemma  8.3,  HJ+H>A  <  HJ+I>A.  so: 


\lL)  IV1'  -  uU;  1  >  -  uUJ  1 

J ,  J+I  ^JA  ^J+I.A'  >  J ,  J+H  ^JA  ^J+I,A'!  * 


(8.4) 


From  Lemma  8.3,  both  quantities  in  parentheses  in  Equation  (8.4)  are 
nonnegative.  Conditioning  on  the  first  jump  from  state  J+I  in  either 
system  yields: 


I , A  pJ-t-I,A!(noJ)l>J+JnoJ^  +  ^J+I.JlU)  +  y  JA  ^PJ+I<J^ 


‘  °x  +  “«'2>  FJ+l(J> 


where 


°X  =  ^J+I,Al(noJ)PJ+I(n°J)  +  ^J+I , J | (J)PJ+I(J) 


(1  2} 

Using  ^  in  Equation  (8.4)  yields: 


ij!}+i!^"(‘-pJ+i‘J>K]  >  xS‘i™ ICl‘-5+I  <J)K5 


(2)  r..(2) 


However,  this  is  impossible  since  X,^  <  X^2^ , 

—  J  ,  J+ri 

the  quantities  in  the  brackets  are  nonnegative,  and  HjA^  <  ^jA^ 
was  assumed.  This  is  the  desired  contradiction. 

Now  consider  the  case  for  which  k  <  J.  Let  B  =  {0  ,  l , . . . ,k-i ) . 

<  it*2^  is  equivalent  to  >  1q2^.  As  before, 

=  m(B)/^B,  and  the  only  difference  between  systems  (1)  and  (2)  is 

^J^  and  ^JB 
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Now  assiane  ^  >  p^)#  This  will  lead  to  a  contradiction. 

J  D  JB 

With  and  -  C^  >  0,  Equation  (8.7)  becomes 


(1)  (1)  _  (1)  (1)  (2)  (2)  _  (2)  (2) 

J, J+I  oB  J, J+I^J+I , B  J, J+H*JB  J,J+H^J+H,B 


(2)  (2) 

From  Lemma  8.3,  PJ+'R  >  so 


.(!)  ,  (1)  ,(1)  .  „  . (2 )  .  (2)  (2) 

XJ,J+I^JB  ^J+I ,  B  <  XJ,J+H^JB  ~  ^J+I ,  B  °r 


X(1)  ru(1)  uU)>  >  X(2)  fu(2)  -  u(2)1  (H 

XJ , J+L  ^J+I , B  MJB  5  XJ , J+H  ^J+ 1 , B  ^JB  }  (8,8) 


where  both  quantities  in  parentheses  in  Equation  (8.8)  are  nonnegative 
by  Lemma  8.3.  Conditioning  on  the  first  jump  from  state  J+I  in  either 
system: 


(1.2) 


(1,2), 


UJ+I,B  ^J+Il (noJ)PJ+I(n°J)  +  (^J+I,J|(J)  +  ^JB  )PJ+I(J) 


\+  4i'2)wj> 


1 


where : 


D\  ^J+I.Ji (noJ)PJ+I(n°J)  +  ^J+I,J|(J)PJ+I(J) 


Using  in  Equation  8.8  yields: 


\j,J+I*DX  “  ^JB  (1"PJ+I(J))1  ?  XJ,J+H[D\  ^JB  (1_PJ+I(J))'  * 
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However,  this  equation  cannot  hold  since  X^  ^  <  X^^ 

J^J+l  —  J  ,  jTri 

the  quantities  in  the  brackets  are  nonnegative,  and  >  |ijjp 

was  assisted.  This  contradiction  means 

<  rS>  -  &><  -*  41’  >  ^2)-  a 

The  following  corollaries  help  relate  Theorem  8.4  to  the  optimal 
operation  of  a  system. 

Corollary  8.5:  If  there  is  a  nondecreasing  utility  function 
corresponding  to  the  states  of  the  CTMC ,  then  under  the  conditions  of 
Theorem  8.4,  system  (2)  has  higher  expected  long-run  utility. 


Proof:  Let  the  utility  of  state  j 


be  a. 
J 


with  a.  >  a .  , . 
J  “  3-1 


M 

I 

i=0 

where 


„  _(1> 

Vi 


M 

l 

i=0 


M 

‘ill. 

J=i 


.(1) 


M  M 

)  <  I  b  [I 

i=0  j=i 


.(2) 


M 

I 

i=0 


a  (2) 
a.  x. 

l  l 


b,  =  a.  -  a,  ,  for  i>l  and  b  =  a„  .  D 
i  i  i-1  —  o  0 


Corollary  8.6:  If  there  is  a  nondecreasing  utility  function 
corresponding  to  the  states  of  the  CTMC,  if  the  hypotheses  of  Theorem 
8.4  hold,  and  if  X^^  =  X^  *  i  >  J,  i  >  J,  then  it  is  optimal  to 
repair  to  state  M. 

Proof:  In  System  (2)  set  H  =  M-J,  and  in  System  (1)  set 
I  «  0,  1,  ...,  M-J-l.  From  Theorem  8.4, 


M 


l 

i=*k 


M 


<  l 

i-k 


*  k. 
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and  from  Corollary  8.5,  this  is  optimal.  □ 


Corollary  8.7;  Under  the  conditions  of  Theorem  8.4,  increasing  a  repair 


rM 

rate  increases  ^  v  k. 


Proof:  This  is  the  case  I  =  H  in  Theorem  8.4.  □ 


It  can  also  be  shown  that  decreasing  a  failure  rate  increases 


VM 

H=k  V 


Corollary  8.8:  If  Theorem  8.4  describes  the  steady-state  operation  of 
one  component  in  a  coherent  MSF,  System  (2)  has  higher  expected  utility 
then  System  (1 ) . 


Proof_:  All  min  paths  at  all  levels  contain  the  component  at  a  certain 

level  or  higher.  Since 


”  «;i)  <  i  .«>  vk. 

i=k  i=k 


h  (P)  is  larger  in  System  (2)  *  k.  D 


■  hypotheses  of  Theorem  8.4  can  be  changed  to  other  hypotheses  which 
were  used  in  the  previous  section. 


n 

Lemma  8.9:  Assume  ).  .  is  nonincreasing  in  i  -v  i  <  i  and 

-  “j*0  ij 

rM  , 

A*-o  *-s  nondecreasing  in  i  v  i  >  i.  Let  A  =  {k ,k+L , . . . ,M } , 

j  x  ij 

Then  |i  i>  \i ^  *  j  <  i  <  k  where  equality  holds  if  and  only  if 


(i)  Xj*  =  Xj+i,A 


=  X^  v  l  m  o,  1,  . . . ,  j-1  and 


1  =  i+1,  ...,  k-1,  and  (ii)  X  »  X,  ■  •••  =*  X.  .  If  (i)  and 

j  A  j+ 1 ,  A  1A 
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(ii)  hold,  p.A  =  •••  =  p..  •  Also,  if  B  =  {0,1, ...,k),  then 
J  A  1A 

p  <  p.D  k  <  j  <  i  where  equality  holds  if  and  only  if 
j  B  —  iB 

(i,)  V =  Vi .1 =  *** =  \i  *  * =  k+1 . j  and  1  ‘ 1+1  • 


....  M,  and  (ii*)  XjB  =  Xj+1>B 
hold,  njB  -  •••«  P^- 


iB  ' 


If  (i  * )  and  (ii') 


Theorem  8.10:  If  Ij=Q  nonincreasin£  in  1  *  A  <  i  and 


M 

£  ^  Ay  is  nondecreasing  in  i  -v  A  >  i,  then 

I*?  l.  1  ^  <  it*  u  -v  k  =  0,  1,  ...,  M. 

£ji=k  i  —  ^i=k  i 


Proof:  The  proof  is  identical  to  Theorem  8.4  with  Lemma  8.9  replacing 
Lemma  8.3.  □ 


In  Theorem  8.4  several  types  of  restoration  may  occur  simultaneously 
from  a  single  state.  In  fact  it  was  not  necessary  to  have 
Aj^  =  0  and  A^)  -  0.  The  weaker  hypotheses 


xO)  ,  x(l)  ,  .(2)  (2) 

J.J+I  J,J+H  -  J,J+I  J,J+H 


and 


A(1) 

J,J+H 


<  A 


(2) 

J,  J+H 


would  have  sufficed,  but  this  lengthens  the  proof.  In  Theorem  8.12  only 
one  restoration  activity  from  each  state  is  permitted.  The  hypotheses 
are  slightly  different  from  Theorem  8.4,  but  the  result  is  the  same. 


Lemma  8.11:  Assume 
i  >  j  for  each  j . 

A  »  {k, . . . ,M}.  Then 


and  only  if  (i)  A 


and  (ii)  A 


j  A 


iA 


1A* 
Also, 


A. .  <  A0,  v  iU  <  i  and  A..  >  0  for  a  single 
ij  -  *j  Ji 

Assume  A..  <  A  and  i  <  m  when  j  <  A.  Let 
j  i  —  Am  — 

p^  P^A  *  j  <  i  <  k  where  equality  holds  if 

“  A^  v  A  =  0,l,...,j-l  and  A  =  i+1 ,  . . . ,  k-1. 

If  (i)  and  (ii)  hold,  then  p  -  p 

jA  j+l,A 

if  B  =  {0,1,..., k),  then  ptg  1  v  k  <  j  <  1 
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where  equality  holds  if  and  only  if  (i')  X  =*  X^  v  i  -  k+l,  ...,  j-1 

and  A  -  i+1,..  ,M ,  and  (ii’)  X.  -  Xjn  .  If  (i')  and  (i i’ )  hold, 

Jo  lo 

then  njB  -  ...  = 


Theorem  8.12:  Assume  X..  <  X,,  -v  j  <  i  <  i  and  X  >0  for  a 
-  ij  —  xj  ji 

single  i  >  j  for  each  j.  Assume  X  <  X.  and  i  <  m  when  j  <  A. 

j  i  —  im  — 


Then  [?,k  „«>  <  £.k  „«>  ,  k. 


Proof :  Let  A  *  {k,k+l , . . . ,M}  and  consider  the  case  J  <  k.  As  in 
Theorem  8.4,  the  only  difference  between  Systems  (1)  and  (2)  is  p 


It  will  be  shown  that  p^^  >  which  Implies 


i(1> 

AA 


,(2  ) 


.O) 


—  ,JAA  whlch  implies  <  .. 


(2) 


the  first  jump  from  state  J  yields: 

•(1)  -  m(l)f  n  +  V  n(l)  „<l> 
m  U)  +  i  PJi  PiA 


(1) 


JA 


i<k 


Conditioning  on 


JA 


X  +X(1) 


X(I) 

1  ,  J,J+I 

i“\  » 


X  +x(1) 


,<l> 

J+I.A 


+  1  r3r~  UlA!(noJ)Pi(noJ)  +  (piJ|(J)+^))Pi^J>1 


i<J  \j*+\j,j+l 


where  X^  =  ar>d  (J)  -  hitting  state  J  before  returning 

to  set  A  (as  in  Theorem  8.4). 


(XJ*  +  XJ,J+I  CX)|iJA 


1  +  X(1)  u(l) 

1  J  ,  J+I  PJ+l ,  A 


+  i<J  XjlllilA|(noJ)Pi(noJ)  +  Pij|(j)Pi(J)  1 


(8.9) 


where  C, 


*i<J  \jiPi(J)* 
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(2)  A  similar  calculation  for  yields: 

,x  +  X(2)  -  C  W(2)  =  1  +  \(2)  U(2) 

(KJ*  +  j ,  J+H  CX^JA  1  +  \j,J+HUJ+H,A 


+  ^  XJi^iA|  (noJ)Pi^n°J^  +  ^ 


1<J 


U  (J)‘ix 


(8.10) 


Subtracting  Equation  (8.9)  from  (8.10): 


(X  +  x(1)  -  C.)^-  X(l)  ^  =  (X  +  x(2)  -  cx)^ 

J*  J,J+I  X  JA  J,J+I^J+I,A  J*  J,J+H  X  JA 


X(2)  U(2> 

J.J+H^J+H.A  ' 


(8.11) 


Now  assume  <  p!:2\  This  will  lead  to  a  contradiction. 

J  A  J  A 

Since  Xj*  -  >  0,  Equation  (8.11)  becomes 


(1)  (1)  _  (1)  (1)  (2)  (2)  _  (2)  (2) 

J , J+I^JA  J , J+I^J+I , A  J, J+H ^JA  J , J+H ^ J+H , A 


From  Lemma  8.11,  so 


,(l)  r  ,(1)  „<l)  x  s  *<2)  (2)  (2) 

\l,J+I'  ^JA  ^J+H , A'  >  J, J+H' ^JA  1J+H,a' 


(8.12) 


where  the  quantities  in  parentheses  in  Equation  (8.12)  are  nonnegative 
by  Lemma  8.11.  Conditioning  on  the  first  jump  from  state  J+H: 


*\j+H,A  =  ^J+H.Al  (noJ)PJ+H(n°J)  +  1^J+H,J|(J)+  ^JA  1  PJ-W(J) 

. (1,2) 


JA 


PJ+H<J>  +  DX 


where 


Dx  =  ^J+H,a!  (noJ)PJ+H(n0j)  +  tiJ+H,j|(J)PJ+H(J) 
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Using  *n  E<luation  (8.12): 

XJ,J+I^JA  ~  D\^  >  \j,J+H^JA  (1_PJ+H(J)) 


Dx> 


However,  £.  Xj^j+h*  the  qua®tities  In  the  brackets 

are  nonnegative,  and  <  HjA^  was  a88IJ,ne<i»  80  this  is  the 

desired  contr  idiction,  and  ^  . 

JA  —  JA 

Now  let  B  »  {0, 1 , . . . ,k-l }  and  consider  the  case  k  <  J.  As  in 
Theorem  8.4,  the  only  difference  between  and  is 


,(l) 

JB 


and  H 


(2) 


JB  ‘ 


(0  **  ’  m(1>(J>  +  ilk  ** 


\  ( 1 ) 

J _  .  J . J-n  u(l)  +  l 

k(l)  x  4-  x(1>  J+1’B  k<i' 

*  T  T  l  T  ^  T*  '  ^  T  T  »  T 


Mi 


\j*+XJ,J+I  'J* '  '\J,J+I 


k<i<J  (1) 

J*  \j,J+I 


X  luiB|(noJ)Pi(noJ)  +  (^iJ|(J)+,1JB>)Pi(J) 


«here  XJ#  =■  1±<J  X^. 


(\  +X(l)  -  C  lu(1)-  1  +  x(1)  n(1) 

{\j*+XJ,J+I  CX^JB  J,  J+I^J+I ,  B 


+  k<\a  XJi^iB|(noJ)Pi(n0j)  +  ^iJ|(J)Pi(J))  (8< 


where  * 


13) 


Ik<i<J  xjiPi(J> 


(2)  A  similar  calculation  for  System  (2)  yields: 


fx  .  x<2)  _  c  .(2),  x(2)  (2) 

*XJ*  +  J ,  J+H  C\  ^JB  1  j , J+H ^J+H , B 

+  k<i<J  XJi["lB!(noJ)Pi(n0j)  +  ‘iiJ|(J)Pl(J)1 


Subtracting  Equation  (8.14)  from  (8.13): 


,x  4.  x(1)  c  w(1)-  x(1)  U(1)  .  (X  +  x(2)  -  C  )U(2) 

(XJ*  +  XJ,J+I  CX  ^JB  J, J+I^J+I ,  B  (\j*  XJ,  J+H 

(2)  (2) 

J , J+H^J+H,B  ‘ 

Now  assume  Hjg'*  >  This  will  lead  to  a  contradiction. 

Since  X  -  C  >  0,  Equation  (8.15)  becomes: 

J*  \ 


(8.14) 


(8.15) 


X(l)  u(1)  -  X(1)  u(1)  <  X(2)  u(2) 

J,J+I^JB  J , J+I rj+I , B  s  J, J+H^JB 


(2)  (2) 

From  Lemma  8.11,  g  _<  Hj+h  g,  so 


x(2)  ,(2) 

J, J+H^J+H, B 


x(l)  i.(l)  _  .XI),  v  x<2)  ,  (2)  _  (2),  fo  l6v 

XJ , J+I  ^J+I , B  ^JB  ]  >  XJ, J+H  ^J+I , B  ^JB  1  *  (8.16) 


where  both  quantities  in  brackets  in  Equation  (8.16)  are  nonnegative  by 
Lemma  8.11.  Conditioning  on  the  first  jump  from  state  J+I: 


4+1, B  ”  pJ+I,B|(noJ)PJ+I(noJ)  +  ( ‘‘j+I.Jl  ( J)+  UJB  ’  ,PJ+I(J) 


^,2>pJ+i(J)  + 


where 


DX  "  ^J+I,B|(noJ)PJ+I(n°J)  +  UJ+I,J|(J)PJ+I(J) 


-168- 


Using  J*  in  Equation  (8.16)  yields: 
J  D 


\j,J+JD\~MJB  (1_PJ+I(J))j  >  XJ,J+H^D\-tlJB  (1_PJ+I(J)^ 


However,  this  is  a  contradiction  since  S.  ^j^j+h’ 

quantities  in  brackets  are  nonnegative,  and  >  pjjp  was 

assumed.  Thus,  .  0 


Theorem  8.13  differs  from  the  preceding  theorems  in  that  any  repair 
from  a  state  smaller  than  state  J+H  must  reach  J+H  before  proceding 
to  any  state  larger  than  J+H.  This  provides  a  slightly  different 
result  without  assumptions  on  the  failure  rates. 


Theorem  8.13:  Let  X^  **  0  whenever  i  <  J+H,  j  >  J+H  or  i  >  J+H, 
j  <  J+H.  Then  }  <  v  k  >  J+H. 

Proof :  Conditioning  on  the  first  jump  from  state  k  J+H : 


\k  =  \k|(noJ+H)  Pk<noJ+H>  + 


[^k,J+H| (J+H)  + 


VH.k1  Pk(J+H) 


(1,2) 

The  only  difference  between  Systems  (1)  and  (2)  is  If 

will  be  shown  that  'jllu  ,  >  i  which  implies 

J+H  ,  K  —  J+H,K 

-  |lkk^  which  lmPlies  <  1tk2^- 


^J+H,k 


^J+H,k|  (noJ)PJ+H(n°J)  +  1  ^J+H,J|(J)  +  ^J,J+U+|iJ+H,klPJ+H(J) 
ttlJ+H,k|(noJ)PJ+H(n0j)  +  (^J+H,J|(J)+^J,J+H)PJ+H(J)  1 

/ll'PJ+H(J)1  *  (8,17) 
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The  only  difference  between  k  and  k  is 

4%  -  £l»- 


Conditioning  on  the  first  jump  from  state  J: 


/.x  CD  (l),Tx  x  V  p^1)  ..(U 

U)  t+h  =  m  (J)  +  l  PJi  ^i  J 

J,J+H  H  Ji  l.J 


J+H 


xO)  X 

1  *J,J+I  (1)  .  y  Ji 

ITT)  X(D  j+i,j+h  t<j+H  \j. 


(i) 


*  ^i,J+H|(noJ)Pi(n°J)  +  (^iJ|(J)  +  ^J,J+H)Pi(J)1 


(1)  -  n  X  x(1)  ,L(1)  +  LWfX(1)-  c  ) 

^J,J+H  "  (1  \j,J+I^J+I,J+H  L)/(XJ*  CV 


(8.18) 


where 


CX  =  l  XJiPi(J) 
X  i<J+H  1 

i^J+I 


L  ‘  „L  XJi(l*l,XH|(noJ)Pi<noJ)  * 
i\  J+n 

i/J+I 


(2) 


(2)  A  similar  calculation  for  jij  j+^  yields 


&  •  “+l>'<1j2.)  -  cx> 


(8.19) 
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Since 


\(1)  + 


l 

tfJ+I, 

J+H 


XJi  < 


X(2)  + 

J , J+H  + 


i*  J+I , 

J+H 


Ji 


and 


,(1)  (1) 

J,J+I*\j+I,J+H  - 


comparing  Equations  (8.18)  and  (8.19)  shows  that 
>  /f2Lu.  Then,  from  Equation  (8.17), 

^J+i,k  >  ^J+H , k  aS  re9uired.  0 

8.3.  Examples 

The  first  two  examples  in  this  section  are  complementary.  Example 
8.1  contains  a  system  with  mean  repair  time  that  is  shorter  when 
repairing  all  at  once  than  when  repairing  in  stages.  However,  the 
optimal  repair  strategy  is  repairing  in  stages.  This  means  that  the 
assumption  X.,  ^  „  cannot  be  replaced  by 

^  XJ , J+H  -  ^^J.J+I  +  l^XJ+I,J+H*  In  ExamPle  8,2  lt: 

is  optimal  to  repair  all  at  once  even  though  mean  repair  time  when 

repairing  in  stages  is  shorter  than  when  repairing  all  at  once. 


Example  8.1:  Consider  the  4-state  CTMC  shown  in  Figure  8.1. 


Figure  8.1.  CTMC 
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r 


In  state  t  there  is  a  choice  between  repairing  all  at  once:  >  0, 


^12  ”  or  rePairl-ri8  stages:  =  0,  >  0.  Let  X^  =  X^  ^  =  X 

■  So  ■  x0l  ■  *•  X,2  ■  5’  XI3  ‘  x23  ‘  2-  To  flnd  the 
steady-state  probabilities,  the  following  equations  are  solved. 


(1) 

X1I3  =  X13xt  +  X23 

n2 

(2) 

(X+X23)it2  =  X^rtj 

+ 

(3) 

(X10+X12+X13)ni  = 

X01*0  + 

(4) 

Koiuo  =  \oui 

(5) 

*o  +  *1  +  *2  +  n3 

-  I 

Case  1:  repair  in  stages,  X^  =  0 


X  -  XSo  +  X\,  -  XX01X12  +  X01X12X23  ‘  ‘°-52 
*0  -  x2xlQ/x  53  -ooio 

*1  “  x2x01/x  =  .0010 

*2“  U01X12A=  -°474 
*3"  X01X12X23A=  -9506 


Case  2:  repair  all  at  once,  X  =  0 
X  =  X  X1Q  +  x  XQi  +  2XX01X13  +  Xq  1^1 3^2 3  =  4,42 


*0  "  x\o/x 
*i  "  x2\h/x 
*2  m  XX01X13/X 


=  .0023 
=  .0023 
-  .0452 


S*  (XXOLXL3+XOIX13X23,/X-  -9502 


7  .  n.  is  larger  in  case  1  -v  k  so  it  is  optimal  to  repair  in 

t‘i“«t  I 

stages.  This  is  true  even  though  expected  repair  time  when  repairing 
all  at  once,  1  /  X  ^  ^  =*  .5,  is  shorter  than  expected  repair  time  when 


-172- 


repairing  in  stages,  l/X^  +  ^^23  ”  ’  T'le  reason  for  this  is  that 

1/ ^1 3  “  *5  >  l/X^  ”  *25,  so  the  system  exits  from  state  1  more 
quickly  when  repairing  in  stages.  This  means  that  the  transition  from 
state  1  to  state  0  will  occur  less  frequently  in  case  1.  The 
example  may  seem  somewhat  contrived  since  X  ^  =  lOX^^.  However,  it  is 
still  optimal  to  repair  in  stages  with  X  *=  X  “  ^2  =  *  when  a 
linear  utility  function  is  used  as  shown  in  Table  8.1.  Repairing  in 
stages  is  not  uniformly  superior  for  all  nondecreasing  utility  functions 
in  this  case  because  1  ^  ^ 


X 

*0 

ni 

*2 

*3 

u  =  io  i7ti 

Case 

1 

17 

.059 

.059 

m 

.588 

2.41 

Case 

2 

10 

.1 

.1 

■ 

.6 

2.3 

Table  8.1.  Repair  Comparison 

Example  8.2:  Consider  the  5-state  CTMC  shown  in  Figure  8.2. 


In  state  l  there  is  a  choice  between  repairing  to  state 
4.  let  X ^  »  .1,  X^  “  1.  X^^  «*  .2,  X ^  “  .1,  X^  “  .25, 


3  or 

in¬ 


state 

1. 
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14 


.1,  Xl3  .25,  XlQ 


1,  * 


01 


1.  The  following  equations  need  to 


be  solved  to  determine  the  steady -state  probabilities. 


^  X43n4  “  X‘141ll  +  X24*2  +  X34n3 

(2)  ( W*3  =  X13ni  +  X23*2  + 

(3)  CX21+X23-h\24)^.2  =  \32n3 


(4)  (Xl0+Ki3+Xi4)7ti  =  X0ino  +  X2in2 

(5)  Nn’b  =  \o*i 

(6)  uQ  +  U1  +  n2  +  *3  +  \  =  1 


Case  1:  repair  in  stages,  \  ^  =  0 


\  -  XioX2lX32X43  +  X01X21X32X43  +  X01X13X32X43 

+  X0IX1  3X43(X21+X23+X24)  +  X0 1 N  3(  X24X32+X2  1  X34+X23  VW 
n0  =  X10X21X32X43/X 
*1  =  X01X2lX32X4  3/X 
^2  =  X01X1  3X32X43/X 
n3  =  X01X13X4  3(VX23+X24)/X 
n4  =  X01X13^  X24X32+X21X34+X23X34+X24X34)/X 


Case  2:  repair  all  at  once,  X^3  =  0 

X  -  K.K..K.K.+  V21S2X0  +  \>lW  V*23 +  W 

+  \  /X  ,  V,,  +  X24X32  +  X2 1 X34  +  X23X34  +  X24^34^ 


10  21  32  43 

^01  X1 4(  '2  1  '32 
*0  "  X10X21X32X4  3/X 


1 


X01X2  1X32X43/X 


*2  “  X01X14X32X4  3/X 

*3  ”  X01X14X43(  X21+X23+X24)/X 

*4  “  X01  X14(  X21X32+X24X32+X21X34+X23X34+X24X34)/X 


The  results  of  the  calculations  are  shown  in  Table  8.2. 
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X 

*0 

*1 

*2 

*3 

\ 

Case  1 

.2612 

.038 

.383 

.096 

.129 

.354 

Case  2 

.2705 

.037 

.370 

.037 

.050 

.506 

Table  8.2.  Steady-state  Probabilities 

From  Table  8.2  it^  is  larger  in  Case  2  -v  k,  so  repairing  all  at 

once  is  optimal  even  though  expected  repair  time  when  repairing  all  at 
once,  1/X  =  10,  is  larger  than  expected  repair  time  when  repairing  in 

stages,  l/X^  +  1/X^  ”  9.  The  reason  for  this  is  that  the  transition 
from  state  3  to  state  2  may  occur  when  repairing  in  stages.  It  can 
be  shown  that  -  11  <  -  22.6.  □ 

The  next  example  is  a  system  for  which  it  is  optimal  to  repair  from 
state  J  to  state  J+H  and  from  state  J+l  to  state  J+I  even  though 
I  <  H. 

Example  8.3:  Consider  the  4-state  CTMC  shown  in  Figure  8.3. 


I 
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In  state  1  there  is  a  choice  between  repairing  all  at  once:  \  >  0, 


X,,  *  0,  or  repairing  in  stages:  XJ3  “  0,  XJ2  >  0.  Let  X„„  =  X. 


12 


32 


21 


=  X 


.1,  X, 


1.  K 


l,  K 


2,  X  *  4,  and  X^  =  2.  The  following 


10  ’  03  ’  23 

equations  are  solved  to  determine  the  steady-state  probabilities 


(1) 

Xlt3  =  X03n0  +  X13 

*1  + 

(2) 

(  \+  X2  ~  ^  ^  ^  ^  i 

+  Xu, 

(3) 

(X10+X12+X13)ni  = 

Xn2 

(4) 

X03n0  =  X10K1 

(5) 

it0  +  n1  +  u2  +  ir3 

=  1 

23  2 


Case  1:  repair  in  stages,  X^  =  0 

\  -  >>2\0  +  *2\,3  +  2^03\0  +  XX03X13  +  X03X10X23  +  X03X12X23 

«o  ’  x2xio/x 

*,  •  x2wx 

*2  ‘  (X\>3X,0  +  XX03X,3,/X 

*3  =  (XX03X10  +  X03X10X23  +  \)3X12X23)/X 


Case  2:  repair  all  at  once,  X 


12 


X  -  X2X1q  +  X2X03  +  2XXq3X10  +  2XXQ3X13  +  X^^  +  X^^ 

*o  ■  XVX 

*1  ■  x2wx 

*2  ^XX03X10  +  XX03X13^X 

*3  "  (XX03X10+XX03X13+X03X10X23+X03X13X23)/X 


The  results  of  the  calculations  are  shown  in  Table  8.3. 
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X 

*0 

*1 

*2 

*3 

Case  1 

10.62 

.001 

.001 

.047 

.951 

Case  2 

6.62 

.0015 

.0015 

.0755 

.9215 

Table  8.3.  Steady-state  Probabilities 
r3 

From  Table  8.3,  ^  is  larger  in  Case  1  -v  k,  so  it  is  optimal 

to  repair  in  stages.  This  is  true  even  though  expected  repair  time  when 
repairing  all  at  once,  1/X^  =  *5,  is  less  than  expected  repair  time 
when  repairing  in  stages,  1/X^  +  ^^23  *  reason  f°r  i^is  is 

that  l/A.^  “  *5  >  l/X^  “  *25,  so  the  system  exists  from  state  1  more 
quickly  when  repairing  in  stages,  and  the  transition  from  state  1  to 
state  0  will  not  occur  as  frequently.  Note,  however,  this  example 
does  not  satisfy  the  hypotheses  of  Theorem  8.4  since  >  X^  when 

repairing  in  stages.  □ 

The  result  of  Example  8.3  is  that  it  is  optimal  to  repair  from  state 
0  to  state  3  and  from  state  1  to  state  2,  even  though  it  is 
possible  to  repair  from  state  1  to  state  3.  A  simple  replacement 
model  is  considered  in  Example  8.4,  and  it  is  shown  that  an  increase  in 
a  failure  rate  could  actually  increase  expected  system  utility. 

Example  8,4:  Consider  the  CTMC  shown  in  Figure  8.4. 
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The  system  modeled  by  the  CTMC  shown  In  Figure  8.4  is  a  system  which  is 

run  until  complete  failure  and  then  replaced.  By  a  standard  renewal 

M 

argueraent  with  \  =  |i,  *  ( 1  / X^) / ^i_o(  l/\>  *  **  \  Increases, 

M 

decreases,  and  increases  V  k  ^  i.  Consider  £  ^ 

*  1  -  1  it. .  If  X..  increases,  then  ,  rt.  increases  for 

Lj=0  j  i  ‘‘J-k  j 

k  >  i  but  decreases  for  k  <  i  (since  Ij=q  11  j  increases). 

Assume  the  usual  nondecreasing  utility  function.  If  X^  increases, 
then  £  k  m  decreases  vk  >  1,  and  the  expected  utility  of  the 
system  decreases.  If  *  p  increases,  then  ^  increases 

v  k  >  1,  and  the  expected  system  utility  increases.  However,  a  change 


in  any  other  X^  may  increase  or  decrease  expected  system  utility 
depending  upon  the  utility  function  and  the  other  transition  rates.  0 
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9.  REPAIR  VS.  REPLACEMENT 


A  discrete  time  model  is  considered  in  this  chapter.  In  each  state 
the  operator  has  a  choice  to  repair  the  raultistate  component,  replace 
the  component,  or  do  nothing.  It  is  shown  that  the  optimal  policy  is  to 
either  repair  or  do  nothing  until  the  component  reaches  a  certain  state, 
and  to  replace  whenever  the  component  drops  below  that  state.  This  is  a 
type  of  control  limit  rule.  The  control  limit  may  be  0  in  which  case 
it  is  optimal  to  never  replace.  It  was  hoped  that  a  4-region  policy  -  a 
policy  for  which  the  optimal  actions  are  inaction,  repair,  inaction,  and 
replacement  as  the  system  state  decreases  -  would  be  optimal  as  it  is  in 
a  similar  model.  However,  an  example  is  presented  in  which  a  5-region 
policy  is  optimal,  and  it  is  conjectured  that  no  limit  on  the  number  of 
possible  regions  exists. 
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(1)  is  nonincreasing  in  i. 

(2)  nondecreasing  in  k  and  nonincreasing  in  i. 

The  second  assumption  means  that  it  costs  more  to  do  more  total  repair 

(C  .  ,  nondecreasing  in  k) ,  and  it  costs  more  to  do  the  same  amount 
i,i+k 

of  repair  starting  in  a  worse  state  (C^  nonincreasing  in  i) .  It 

is  also  expected  that  <  C  for  some  i  and  >  C  for 

some  i  so  that  there  is  a  nontrivial  choice  between  repair  and 
replacement.  If  C  v  i,  replacement  is  always  better  than 

repair.  If  C  _<  C  v  i,  repair  is  always  better  than  replacement. 

As  in  previous  chapters,  p  is  the  1-period  transition  probability 
from  state  i  to  state  j  given  that  no  restoration  takes  place  during 
the  period.  The  usual  assumptions  pertaining  to  transition  probabilities 
are  made,  namely 

r-k 


(3)  Ij=0  Py  is  nonincreasing  in  i  -v  k. 


(4)  p^  >  0  for  some  k. 

Assume  that  state  occupancy  costs  are  not  paid  during  restoration  and 
that  doing  nothing  is  not  a  possible  action  in  state  0.  Optimal  system 
operation  in  the  discounted  case  is  described  by  the  following  recur¬ 
sion. 


M 


<Ki,a,N+l)  -  min  (A  +  a  l  p..<Kj,a,N)  ;  C  +  aiJ>(M,a,N) 

j-0 


C1  i+1  +  a<Ki+l,a,N)  ;  •••  ;  +  a<KM,a,N) }  for  i>0 


min  {C  +  a<KM,a,N)  ;  CQ1  +  a<Kl,a,N)  ; 


*•*  5  CQM  +  a<KM,a,N)} 


for  i 


(9.1) 
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Theorem  9.1  shows  that  a  special  type  of  control  limit  rule  Is  optimal 
for  this  system. 

Theorem  9.1.  If  assumptions  (l)-(4)  hold,  then  there  exists  an  i*  for 
which  it  is  optimal  to  replace  whenever  i  i*,  and  to  either  repair  or 
do  nothing  whenever  i  >  i*  for  both  the  discounted  and  average  cost 
optimality  criteria. 


Proof :  Since  and  are  nonincreasing  in  i,  it  is  clear  that 

4»(i,a,0)  is  nonincreasing  in  i  from  Equation  9.1.  Assume,  inductive¬ 
ly,  that  <Ki,a,N)  is  nonincreasing  in  i.  Then,  since 
vM 

Aj^O  P^j  a»N)  is  nonincreasing  in  i  by  assumption  (3),  <Ki,<*,N) 

is  nonincreasing  in  i  *  N.  Thus,  <«i,<i,N)  is 

nonincreasing  in  i. 


M 

<Ki,a)  -  min  {A  +  a  J  p  Mi,  a)  ;  C  +  oi«M,a)  ; 

j=0 

•••  C.u  +  aiRM.a.N))  for  i  >  0 

J-M 

=  min  {C  +  a<KM,a)  ;  •••  ;  CQM  +  a<KM,a)}  for  i  =  0 

Note  that,  in  the  above  recursion,  C  +  ct<KM,a)  is  constant  in  i 

while  all  other  terras  are  nonincreasing  in  i.  Thus,  if 

'Ki,«)  *  C  +  a<KM,a)  for  some  i  =  i*,  then  <Ki,a)  ■  C  +  a<KM,oc) 

v  i  <  i*.  If  C  +  a<KM,a)  never  minimizes  <Ki»a)>  set  i*  »  -1. 

This  completes  the  proof  in  the  discounted  case.  In  the  average  cost 

case,  take  $(i)  -  lim  (l-a)<Ki,a),  and  apply  the  usual  limiting 
a-*T 

argument. □ 
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The  ideas  contained  in  Chapter  7  can  be  used  to  extend  this  result 
as  shown  by  the  following  corollaries. 

Corollary  9.2.:  If  costs  A^  are  paid  during  repair  and  replacement, 
the  control  rule  described  in  Theorem  9.1  is  still  optimal. 

Proof :  The  recursion  for  i  >  0  becomes 

M 

4>(i,a,N+l)  =*  A  +  min  {a  J  P^.  <K  j  ,  a,N)  ;  C  +  acKM.a.N)  ; 

j=0  J 

•••  ;  +  aiKM,  a,N) } 

Since  the  minimization  term  is  Equation  (9.1)  with  A^  *  0  v  i ,  the 
result  follows  from  Theorem  9.1.D 

Corollary  9.3:  Let  p^  be  the  probability  that  repair  C^  suc¬ 

ceeds,  and  let  p  be  the  probability  that  replacement  succeeds.  If 
cos t s  A^  are  not  paid  during  replacement,  then  the  control  limit  rule 
described  in  Theorem  9.1  is  still  optimal  for  both  <Ki,a)  and  4>(i). 

Proof ;  The  standard  recursion  becomes: 

M 

4>(i,  a,N+l)  =  min  {A  +  a  l  p..<Kj,a,N)  ;  C  +  ap4-(M,a,N) 

J-0  J 

+  a(l-pH(i,a,N)  ;  Cf  l+l  +  ap  ^(i+l  ,a,N) 

+  a(l-pij  )<Ki>  <*,N)  ;  •••  ;  C^  +  ap^tKM.a.N) 

+  a(l-pM1)(Ki,a,N)}  i  >  0 
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-  min  {C  +  a.  p  <KM,a,N)  +  a(  l-p)<KO,a,N)  ; 

CQM  +  apM+I<KM,a,N)  +  a(  1-p^j ) 4»(0 , a,N) }  i  *  0. 


As  in  Theorem  9.1,  everything  is  nonincreasing  in  i  so,  <Ki,a)  is 
nonincreasing  in  i.  As  in  Theorem  7.4,  if  the  replacement  action  is 
optimal  for  state  i*  ,  <Ki*,a)  =  [C  +  ap<KM,a,N)  ]/ [  l-a(l-p)  ) .  Since 
<Ki,a)  is  nonincreasing  in  i  and  <Ki*,<*)  is  independent  of  i, 
<Ki,cc)  *=  cl(i*,a)  v  i  <  i*.  If  replacement  is  never  optimal,  set  i*  = 
-1.  The  usual  limiting  argument  yields  the  result  for  <Ki).  If 


restoration  costs  are  not  paid  when  restoration  fails,  replace  C  by 
pC  and  replace  C^.  by  p^_^C^,  an<*  c^e  anaIysfs  Is  unchanged.  □ 


With  C^  =  oo  v  i,k,  the  model  in  this  section  is  the  same  as 
those  in  Chapter  7.  Thus,  from  Example  7.2,  a  control  limit  rule  is  not 
necessarily  optimal  in  the  failure  to  replace  case  when  state  occupancy 


cost  A^  are  paid  during  replacement. 

The  repair/replace  model  can  also  be  extended  to  the  continuous 


case.  The  problem  is  considered  as  a  CTMDP.  Costs  C,C^,  and  be¬ 

come  cost  rates.  It  is  assumed  that  the  system  does  not  operate  during 


restoration  and  that  state  occupancy  costs  are  not  paid  during  restora¬ 
tion.  The  action  set  is  {0  =  do  nothing,  1  =  replace,  2  =  repair  i  -*■ 
l+l ,  ••*,  M-i+1  =»  repair  i  ♦  M}.  Transitions  from  state  i  to  state  j 


occur  at  rate  A  ( j  <  i)  when  no  restoration  occurs.  The  replacement 
rate  is  A,  and  the  repair  rate  from  state  i  to  state  j  is  A  (j  >  i) 
Some  kind  of  restoration  must  be  performed  in  state  0,  Let 


y  »  Aj(a).  The  rewards  and  transition  probabilities  for  an 
equivalent  DTMDP  are  as  follows  (from  Theorem  7.6). 
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rt(0)  = 

rt(l)  » 
rt(j)  = 


A±/  (  ^-Y) 

cl ( 3+y) 


c  /O+y) 
l.i+J-l  T 


for  j  >  1 


X. .  /  y 


V°> 


for  j  <  i 
for  j  =  i 


for  j  >  i 


p  i j  ( 1 )  =  |  1  '  X/y 


for  j  =  M 
for  j  =  i 
for  j  4  i,M 


)  1, i+k- 1 

V  0 


for  j  =  i+l ;k  >  1 


Pij(k)  =  )  1  "  \,i+k-l/Y  £or  j  =  i;k  >  1 


for  j  4  i,i+l;k  >  1 


Corollary  9.4:  Let  assumptions  (I)  and  (2)  remain  valid,  and  Let  as¬ 
sumptions  (3)  and  (4)  be  replaced  by 


(3'  )  X  _>  X  -v  j  <  i  <  A  or  nonincreasing  in 

i  v  k. 


<*’>  \i  Xi  i 

i  •  •  •  i 

V  ’  n 


X.  >  0  for  some  sequence 

v° 


Then  the  control  limit  rule  described  in  Theorem  9.1  is  optimal  in  both 
the  discounted  and  average  cost  cases. 


Proof:  The  recursion  for  the  DTMDP  with  i  >  0  is 
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M 

<Ki,a,N+l)  =■  min  {A  /(p+y)  +  a  £  p  (0)<Hj,a,N)  ;  C/(P+y) 

j-0  J 

+  «X<KM,a,N)/y  +  a(l-\/y)cKi,  a,N)  ;  Ct  1+1/(0+y) 
+  aXi  i+1 4>(i+l » a,N)/ y  +  a(l-Xi  1+J  / y)«|»(i  ,  a,N)  ; 

***  ;  CiM^^Y*  +  aX1>I<KM>(:[>N)^ 


+  ad-X^/ y)<Ki ,  a,N) } 


rM 

From  Lemma  7.7,  Z-_q  (0)  <K  j  » <x,N)  is  nonincreasing  in 

(k) 

sumption  (4')  means  that  P^g  ®  for  some  k.  Thus, 

p  =  /y  and  p  =  X/y,  this  recursion  is  the  same  as 

K  1 ^ ItK 

Corollary  9.3. □ 


i,  and  as- 
with 
that  in 


9.2.  Examples 

The  control  limit  rule  in  Section  9.1  differentiates  only  between 
replacement  and  the  other  alternatives.  It  would  be  nice  to  show  that 
there  is  a  region  of  the  state  soace  for  which  repair  is  optimal  and  a 
region  for  which  inaction  is  optimal.  In  state  M  it  must  be  correct 
to  do  nothing  since  otherwise  the  system  would  be  in  a  constant  state  of 
repair.  From  the  previous  section,  it  is  optimal  to  replace  in  states 
0,1,  ...,i*.  Thus,  in  states  i*+l , . . . ,M- 1 ,  the  optimal  action  will  al¬ 
ternate  between  repair  and  inaction.  A  "nice"  policy  is  one  in  which 
the  optimal  action  alternates  infrequently  such  as  t he  3-region  and  4- 
region  policies  shown  in  Figure  9.1. 
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States 

0,  . . . ,i* 

i*+l ,  .  . .  ,i'-l 

i' . M 

■MM 

Optimal  Action  for 
3-Region  Policy 

Replace 

Re  pair 

Inaction 

Optimal  Action  for 
4-Region  Policy 

Replace 

Inaction  for  Ki^ 
Repair  for  i>i() 

i*  <  i0  <  i"1 

Inaction 

Figure  9.1.  3-Region  and  4-Region  Policies 


Example  9.1  is  a  4-region  policy.  Policies  of  this  type  were  found 
to  be  optimal  in  Rosenfield  [1976A]  when  the  possible  actions  were  re¬ 
placement,  inspection,  and  inaction.  Unfortunately,  as  shown  in  Example 
9.2,  a  4-region  policy  is  not  necessarily  optimal  for  the  repair/replace 
model.  Since  Example  9.2  contains  a  linear  utility  function,  a  totally 
positive  transition  matrix,  and  repair  only  from  state  i  to  state  i+1, 
it  is  difficult  to  think  of  hypotheses  which  would  make  a  4-region  poli¬ 
cy  optimal.  It  is  hypothesized  that  a  counter-example  can  be  devised 
for  any  n-region  policy  where  n  is  a  finite  number. 

Example  9.1:  Consider  the  4-state  DTMC  shown  in  Figure  9.2. 


Figure  9.2.  DTMC 
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Repair  is  restricted  to  repair  from  state  1  to  state  1+1.  Let 

A3  =  ~3,  A2  =  A1  “  _2,  A0  *  °*  C  “  6*  Ci,i+1  "  4  *  11  P33  =  *8,  P32  = 

.1,  p^  =  .1,  p^  =  .1,  P£q  *  .9,  p  *  1,  and  a  =  .9.  The  recursions 

in  the  discounted  case  are  shown  below. 

4<(3,a)  =  +  a(p334>(l,a)  +  P324<(2,a)  +  p^cKl.a)] 

<K2,a)  =  rain  {A2  +  alp^iKl.a)  +  p20<|»(0,a))  ;  C23  +  atK3,a)  ; 

C  +  a<K3  ,  a) } 

<Kl,a)  =  rain  {A^  +  arp^4)(0,a>  ;  C  2  +  a<K2,a)  ;  C  +  aiK3,a)} 

4<(0,a)  =  min{C0i  +  onX  1 ,  a)  ;  C  +  a<K3,a)} 

Solving  the  recursions  yields: 

(K3,a)  -  ^A3+ap32C23+ap31Al+a2p31C^^1'‘a:p33_a2p32_a3p31^  =  ~18-51 

<K2,a)  =  rain  {-11.68;  -12.66;  -10.66}  =  -12.66  (repair) 

<K  1 ,  a)  =>  min  {-11.6;  -7.4;  -10.66}  =  -11.6  (do  nothing) 

<K0, a)  =  min  {-6.44;  -10.66}  =  -10.66  (replace) 

Thus  the  optimal  actions  are  inaction  in  states  1  and  3,  repair  in 
state  2,  and  replacement  in  state  0  as  shown  in  Table  9.1.  This  is  a 
4-region  policy.  0 


Table  9.1.  4-Region  Policy 
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Figure  9.3.  DTMC 


The  only  allowable  repair  is  from  state  i  to  state  i  +  1.  Let 
*4  -  -1  .  t.  C  .  7,  Ct  1+1  -  1.95  v  1. 

and 


0 

.01 

.899 

.1 

0 

0 

0 

.01 

.99 

0 

p  - 

0 

0 

0 

.99 

.1 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

The  transition  matrix  is  obviously  upper  triangular  and  can  be  shown  to 
be  totally  positive.  The  optimality  criterion  is  expected  average  cost 
which  is  computed  from  the  equation  E(average  cost)  ■  E(cost  per  cycle)/ 
E(cycle  length). 


The  return  state  is  the  largest  state  that  the  process  will  return 
to  infinitely  often  (normally  state  4).  In  Table  9.2  all  permissable 
policies  are  considered.  The  table  has  been  abbreviated  by  using  the 


control  limit  rule  in  Section  9.1.  always  doing  nothing  in  state  4,  and 
choosing  only  between  repair  and  replacement  in  state  0.  Dashes  appear 
in  the  table  whenever  the  chosen  action  is  immaterial  to  the  calculation 
of  expected  average  cost.  It  can  be  seen  from  Table  9.2  that  the  policy 
yielding  the  lowest  expected  average  cost  is  inaction  in  states  2  and 
4,  repair  in  states  1  and  3,  and  replacement  in  state  0.  This  might 
be  called  a  5-region  policy. □ 


State 

L‘ 

3 

2 

1 

0 

E(Average  Cost) 

N 

RR 

RR 

RR 

- 

.03 

N 

RR 

RR 

RL 

- 

.072 

N 

N 

RR 

RR 

- 

.294 

N 

N 

RR 

RL 

- 

.54 

N 

RL 

RL 

RL 

- 

1.75 

N 

N 

RL 

RL 

- 

1.498 

Actions 

N 

RR 

RL 

RL 

- 

1.497 

N 

N 

N 

RL 

RL 

.413 

N 

RR 

N 

RL 

RL 

.413 

N 

N 

N 

N 

RL 

.053 

N 

N 

N 

RR 

RL 

-.018677 

N 

N 

RR 

N 

RL 

.413 

N 

RR 

N 

RR 

RL 

-.018681 

N 

RR 

N 

N 

RL 

.053 

- 

- 

- 

N 

RR 

.475 

- 

- 

N 

RR 

RR 

-.015 

N 

RR 

RR 

N 

RL 

.031 

N  -  Do  Nothing 
RR  *  Repair 
RL  »  Replace 


Table  9.2.  Policy  Comparison 
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APPENDIX 


Proof  of  Lemma  8.3: 


First  it  is  shown  that  if  (i)  and  (il)  hold,  then  p 


Froa  the  inequalities  for  A^ 


and  A, 


X1A  XjA 


JA 

where  j  <  r  <  i. 


j+l.A 


it 

•  •  ■  A 


iA ' 


JA  IA 

Xj*  "  Vl.A  *  “  Xi* 

For  each  state  r 


rA 


k.  +  ^  Vt^lA^r* 
r  •  Kk  rx  **  r 


(lr*  +  \Aa 


i 

1  +  ^  Xri^M  +  ^  Nri^iA 

CKKj 

KKk 


where 


Xr*  *  l  \l 
CKKj 

i<l 


One  solution  to  these  equations  is 


rA 


(X  +  I  \..»SaV\ 


OCJKj 

i?JKk 


rf'xA '  r*  * 


This  solution  is  constant  in  r  v  j  ^  r  <  i  and  oust  be  unique 
since  the  system  is  a  positive  recurrent  irreducible  CTMC.  Thus, 

V . ^iA* 

Now  assume  that  at  least  one  of  (!)  and  (ii)  does  not  hold.  It 
will  be  shown  by  induction  on  the  size  of  the  state  space  that 
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^OA  >  ^LA  >  ***  >  *\c-l  a  V  A*  For  M  *  l»  the  lemma  18 
obvious.  Assiaae  p^  >  •••  >  p^_j  A  *  A  and  -v  M  »  1,  2,  ...,  N. 

Consider  a  set  A  -  {k,  ....  tH-l)  with  k  <  N  as  a  single  state  by 

ull.  1 

setting  X...  -  .  X.,  for  all  i  It  A.  The  state  space  then 

1A  X*  K  IX 

has  fewer  than  N  states,  so  the  induction  hypothesis  iaplies 
that  p^  >  •••  >  Pjj_1  A*  Thus,  it  suffices  to  show  that 

PQ  N+l  >  ***  >  ^J.N+l*  Each  ,N+1  18  a  continuous  function  of  X^ 
v  A,m  when  X^  >  0.  From  the  inequalities  on  the  transition  rates, 

yN+,  >  «•  Ut  *•  «•  ">d  let  \,N+1  *  -•  Ihe"  “jA*  dj ,N+1 

V  j,  aid  ^  >  •••  >  Vi, A  or  “o.M+I  >  "•  >  Vl.N+1  fr°" 

the  induction  hypothesis.  Also,  p^  +  0  as  X^  N+1  ♦  ®,  so 

^N.N+l  <  Vl.N+1' 

By  continuity,  since  the  p,  m-.'s  are  strictly  ordered  for 

J  ,N+l 

X„  „ ,  ,  -*  they  must  also  be  strictly  ordered  for  some  large  finite 
N,N+ 1 

value  of  X^  N+^.  Start  with  Xj^  fJ+1  at  that  large  value  and  decrease 
it  toward  its  original  value.  If  it  reaches  its  original  value  with  the 
Pj  N+j'8  still  strictly  ordered,  the  proof  is  finished.  If  not,  let 
X^  be  the  largest  value  of  X^  for  which  an  equality  occurs. 

Let  e  be  a  small  positive  number.  By  continuity,  for  some  j  and 

i  >  j: 


XN,N+1  “  Nl.N+l  +  E  ^j-l.N+l  >  ^j,N+I  >  >  ^i,N+l  >  ^i+l  ,N+1 

XN,N+1  "  Ni.tffl  >  ,  N+l  "  *'*  ”  ^i,N+l  >  **1+1, N+l 

Yn+1  ”  M.NH  "  C  ^i+l,N+I  <  ^,Hfl  <  ***  <  ^i,N+l  <  ^j-l,N+l  * 
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Conditioning  on  the  first  jump  from  state  j  yields: 
“’“j.Wl  ‘  ■<J)  +  »jl  “l.W-l 


where 


Vj,j+1 


X«-  l  Xi  L 

J  #j+l,N+l  J 


<Xj*  +  Xj,j+1  +  ^.N+l^j.N+l 


1  + 


,»H  +  JN  Xjt  ^,N+1 
#j7j  +  l 


(2)  A  similar  calculation  for  state  j+1  yields: 


*  +  N+l.i  +  X 


j+1,*  j+l,J  j+1 ,N+1  j+1 ,N+1 


1  +  Vl.j^j.N+l  +  JN  Vl,*  "i.N+l 


where 


J+1’*  J+1’* 


(A.i) 


(A. 2) 


Subtracting  Equation  (A. 2)  from  (A.I): 
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(V  *  Xj.J+i  +  + 

-  <xj+i,* +  Vu  +  Vi.w  +  ^.j+^Vi.w-i 
+  ^  (xii '  Vm)|1*.»h  ’ 

KN  J 


Wlth  Yn+1  •  *H,W-i  80  that  Yn+1  "  Vl,N+l 


,  this 


equation  becomes 


(Xj*  +  Xj,N+l)Vij,N+L 

"  (Xj+lf*  +  Xj+1,N+1)  Vl.N+1  +  Z<N  UJA~  Vl.A)YN+l 


JWj.j+1 


Since  Xj  N+1  <  Xj+1N+1  . 

V  “j,m  2.  Vi.*  Vi.w  +  J„  (V  ■ 

Vj.j+l 

where  equality  holds  if  and  only  if  ,n+1  ™  Xj+1,N+1* 

Rewriting  Equation  (A. 3)  using  the  definitions  of  \jit  and 


(A. 3) 


L  xjX(tij,m‘,iA.N+i)  -  JN  J+l 


>  i  l  or 


KN 

JVj.J+l 


A<N 

Vj.J  +  l 


l  V^'W  +  ,<L 


(A.4) 


193 


' 


where  Pj  ^  -  •••  *  was  used  to  derive  Equation  (A.4).  For 

*  <  j*  ^J.N+l  ”  ^*,N+1  *  Vl.N+l  ~  ^,N+1  *  °  Snd  XjJl-  Xj+1.*' 

For  l  >  1,  ~  ^>N+i  “  ^j+l,N+l  "  ^Jl.N+l  >  0  and  XH  ~  Xj+l,r 

Thus,  each  term  on  the  left  hand  side  of  Equation  (8.4)  Is  smaller  than 

or  equal  to  the  corresponding  term  on  the  right  hand  side.  The  only  way 
that  Equation  (A.4)  can  be  valid  is  if  A  #  *  A  .  -v  0  <  1  <  j-1, 
i+1  1  <  N,  in  which  case  it  holds  at  equality.  From  Equation  (A. 3)  , 

A  ■  A  .  Thus,  conditions  (i)  and  (ii)  hold  with  i  -  j+1, 

j  9  NT  1  jT  1  ,  Nr  1 

A  -  {NH}. 

Now  repeat  the  above  calculations  for  pairs  of  states 
(j+1,  j+2) ;  ;  (1+1,1).  For  Equations  (A. 3)  and  (A.4)  (appropriately 

modified)  to  remain  valid,  it  must  be  that  A  «  •••  -  A  v 

J  " 

0  £  1  j-l  and  i+1  _<  1  <  NH.  However,  this  is  equivalent  to  (i)  and 
(ii)  with  A  ■  {NH}  which  is  a  contradiction  since  it  was  assuned  that 
at  least  one  of  (i)  and  (ii)  did  not  hold.  Thus,  there  does  not  exist 
Nl  N+l  such  tllat  N+1  *  *•*  *  and  continuity  implies 

th“  ■‘0.1H-1  >  •"  >  V«+l- 

To  prove  the  result  for  set  B  ■  {0,  1,  ...,  k),  let  state  i 

( 

correspond  to  state  M-l  in  the  previous  argument.  Then  set  B 
becomes  set  A,  and  the  result  follows.  0 


Lemma  A.  1 :  Assume  ljmQ  *-8  nonincreasing  in  i  v  i  <  i 

and  l^mi  is  nondecreasing  in  i  ■v  1  >  i.  Let 

0  <  *o  <  <  -m  and  b0  i  bl  i  i  \  <  °*  Then  ^J-JlXijaj 

and  Xijbj  are  aon<*ecr«a#ing  in  i  v  Jl  >  i  and  w  i  <  i, 

respectively.  If  (a^}  is  a  strictly  increasing  sequence,  then 
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*  1  >  *•’  ?-l  Xi+l,jaJ  “  Xijaj  <-4  Xi+l,j  "  Xij 
v  j  -  1,  M.  If  {bj }  is  a  strictly  increasing  sequence,  then 

V  1  K  i*  h-0  Xljb4  “  Ej-0  Xi+l,jbj  <m*  Xij  “  Xi+l,j 

v  j  *  0,  « . « ,  -l. 


Proof:  Let  A.  = 


^  =  ai~ai_I  >  0  v  i  >  i,  and  let  =  a^  _>  0, 


M 


M  M 


M  M 


^  J  (Vm  rV\  ‘  1  \  1  (Vi  rV  • 

j-k  1  1,J  J  K  k«A  j-k  11  1J 


(A. 5) 


By  hypothesis,  Ij,k( *1+1  f j~Xij >  >  °*  80  slnce  \  1  °»  l” L  i  Xijaj 
is  nondecreasing  in  i.  If  A^  >  0,  equality  in  Equation  (A. 5)  can  hold 

if  and  only  if  xi+l  ^  “  0  *  k  “  * . M  or  Xi+l,j  “  Xij 

*  j  =*  l . M. 

Let  =  bt  -  b^+1  <  0  *t  i  <  A,  and  let  B^  =  <  0, 


j-o 


(Vi.rV“3 


i  i  <\+l  rV\  •  i  \  J  <\+i  ,-v  • 

k=0  j“0  1  K  k«0  j=0  °  J 

(A. 6) 


By  hypothesis  Ij»0(  Xi+1  ,j“Xij J  -  °»  80  since  Bk  -  °*  ^j-0  Xijbj 
is  nondecreasing  in  i.  If  B^  <  0,  equality  in  Equation  (A. 6)  can  hold 

if  and  only  if  X*,0<  \+L  ,j'Xi;)  >  "  °  *  k  "  ° . *  °r  Vl.j'Sj 

^  j  *  0,  •••»  1 •  D 


Proof  of  Lemma  8.9:  The  proof  of  Lemma  8.9  is  identical  to  the  proof  of 
Lemma  8.3  except  at  the  places  where  hypotheses  on  the  X^'s  are 
Invoked.  The  necessary  changes  are  shown  herein,  but  the  entire  proof 
is  not  repeated. 
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Since  X  ^  =•  X^  for  j  <  i  does  not  necessarily  imply 

Xjj  -  •••  ■  X^  in  Lemma  8.9,  it  is  assumed  that  X^  ^  *  •••  *  X^  in 

condition  (i)  and  X  =  •••  =  X  in  condition  (ii).  Note  that 

j  ,A  iA 

Xj,N+l  -  Xi,N+l  v  J  <  1< 

Consider  Equation  (A. 4) .  For  l  <  j,  N+1  -  p^  N+1  = 

Vl.N+1  '  ^.N+l  <  °’  and  by  Lemma  (A*U 

^  XjA(^j  .N+l'^A.N+l5  -  ^  Vl,*(Vl,w‘|l*,IW)  * 


where  equality  holds  if  and  only  if  .  -  \  .  .  *  i  *  0,  ...»  j-1, 

j  *•  ^ 

F°r  1  >  *’  ^,N+1  "  ^l.N+1  =  Vl.N+1  '  “l.N+l  >  °*  and  by  Lemma 

(  A.  1 ) 


iJ<N  XJ*(^’N+1  -i<A<N  XJ+1,*(  Vl.N+r^A.N+P  * 


where  equality  holds  if  and  only  if  X ^  ^  *  X^+1  ^  -v  l  *  i+l,...,N. 
Thus,  Equation  (A. 4)  is  not  valid  unless  conditions  (i)  and  (ii)  hold, 
and  the  same  contradiction  exists  that  exists  in  the  proof  of  Lemma 
8„3.  D 


Proof  of  Lemma  8.11.  The  proof  of  this  lemma  is  identical  to  the  proof 

of  Lemma  8.3  to  the  point  at  which  the  calculations  of  p.  and 

J  *N+I 

**j+l  N+1  begin  (Equations  (A.l)  and  (A. 2)).  This  proof  begins  at  that 
point  with  X  >0  and  X  >  0  for  j  <  r  ^  s. 

J  *  J  *  *  9  S 


u>  “j.»«  ■  ■»>  +  ,1  V  “».i 


+  V N+1  +  Xrttlt,N+l^(S‘+>'jr) 
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r 


where 


Xj*  H  ^  Xjjl  * 

J  Kj  J 


(X 


y+V)^j.N+1  “  1  +  ^jr^r.N+l  + 


(2)  A  similar  calculation  for  j+1  yields: 


(Xj  +  l,*+Xj+l,j+Xj  +  l,s)kij+l,I«-l 

=  1  +  Vl.j^J.N+l  +  Vl.s^s.N+I  +  Xj+1,X^,N+1 


where 


Xj  +  1.*  "  Vl,*  * 


Subtracting  Equation  (A. 8)  from  (A. 7): 


( V+Xjr+Xj  +  l,j)tlj,N+l  '  V^r.W-l 


(Xj  +  l,*+Xj+l.s+Xj+l,j)^+l,N+l  ‘  Xj+1 , s ,N+1 
+  Kj 


^■e  ^  ^fJ^JJO.1  “  Nu  MO-1  S0  that  KLL!  “ 


,N+1  “  ^N.N+l  ‘•“aL  **j  ,N+1  l*j+l  ,N+1  * 

Rewriting  Equation  (A. 9)  using  the  definitions  of  X  and  X  , 

J*  J+1 

yields: 


1 


(A. 7) 


(A. 8) 


(A. 9) 


r 


For  X  <  j,  N+1  -  ^j+i  ,N+1  <  UX,N+1*  and  XjX  -  Xj+l,X*  80 


A|j  XjX(,ij  .N+T^X.N+l^  -  Xj+l,X(|ij+l,N+l_MX,N+l)  <  ° 


where  equality  holds  if  and  only  if  X. .  =*  X  ,  .  for  X  <  j.  Since 

jX  j+l,X 


Xjr  —  Xj+l,s  and  ^r,N+l  -  ^s,N+r 


Xjr(Mj  .N+i'A.N+l*  -  Xj+1 (  +  l  ,N+1) 


where  equality  holds  if  and  only  if  s  <  i  or  X.  =  X.  ,  and  =  s 

—  Jr  j+l.s 

Thus,  Equation  (A. 10)  can  hold  if  and  only  if  X. .  =  \  .  v 

j  x  j+ 1 ,  x 

X  *  0,  ....  j-1  and  X  “  i+1 ,  ....  N+1.  Repeating  the  argument 
pairs  of  states  (j+l,j+2);  (i-l,i),  the  appropriate  modifier  .jns 

of  Equations  (A. 9)  and  (A.  10)  hold  if  and  only  if  X  ^  ■  X^A  -v 

*  “  0 . j-1  and  X  -  i+1 . n+1.  Thus,  pQ  N+J  >  •••  >  N+1 

when  conditions  (i)  and  (ii)  do  not  simultaneously  hold. 

To  prove  the  result  for  set  B  *  {0,  l,  ....  k},  let  state  i 
correspond  to  state  M-i  in  the  previous  argument.  □ 


198 


BIBLIOGRAPHY 


Barlow,  R.  E.  and  F.  Proschan  [1975AJ,  Statistical  Theory  of  Reliability 
and  Life  Testing,  Holt,  Rinehart,  &  Winston,  Inc.,  New  York. 

Barlow,  R.  E.  and  F.  Proschan  [1975B],  "Importance  of  System  Components 
and  Fault  Tree  Events,"  Stochastic  Processes  and  their  Applica¬ 
tions  ,  Vol.  3,  pp.  153-173. 

Barlow,  R.  E.  and  A.  S.  Wu  [1978],  "Coherent  Systems  with  Multistate 
Components,"  Math.  Oper.  Res.,  Vol  3,  pp.  275-281. 

Bergman,  8.  [1978],  "Optimal  Replacement  under  a  General  Failure  Model," 
Adv .  Appl .  Prob .  ,  Vol.  10,  pp.  431-451. 

Birnbaum,  Z.  W.  [1969],  "On  the  Importance  of  Different  Components  in  a 
Multicomponent  System,"  Multivariate  Analysis-II,  ed.  by  P.  R. 
Krishnaiah,  Academic  Press,  New  York,  pp.  581-592. 

Birnbaum,  Z.  W.  and  J.  D.  Esary  [1965],  "Modules  of  Coherent  Binary  Sys- 
tems,"  SIAM  J.  Appl.  Math,  Vol.  13,  pp.  444-462. 

Birnbaum,  Z.  W.,  J.  D.  Esary,  and  S.  C.  Saunders  [1961],  "Multicomponent 
Systems  and  Structures,  and  their  Reliability,"  Technometrics , 

Vol.  3,  pp.  55-57. 

Bodin,  L.  D.  [1970],  "Approximations  to  System  Reliability  using  a  Modu¬ 
lar  Decomposition,"  Technome tries ,  Vol.  12,  pp.  335-344. 

Butler,  D.  A.  [1979],  "A  Complete  Importance  Ranking  for  Components  of 
Binary  Coherent  Systems,  with  Extensions  to  Multistate  Systems," 
Nav.  Res.  Log.  Quart.,  Vol.  26,  pp.  565-578. 

Butler,  D.  A.  [1982],  "Bounding  the  Reliability  of  Multistate  Systems," 
Oper.  Res. ,  Vol.  30,  pp.  530-544. 

Denardo,  E.  V.  [1967],  "Contraction  Mappings  in  the  Theory  Underlying 
Dynamic  Programming,"  SIAM  Review,  Vol.  9,  No.  2,  pp.  165-177. 

Derman,  C.  [1962],  "On  Sequential  Decisi' ns  and  Markov  Chains,"  Man . 

Set. ,  Vol.  9,  No.  1,  pp.  16-24. 

Derman,  C.  [1963],  "On  Optimal  Replacement  Rules  when  Changes  of  State 
are  Markovian,"  Mathematical  Optimization  Techniques,  ed.  by  R. 
Bellman,  Chapter  9 , 'University  of  California  Press,  Berkeley. 

Derman,  C.  [1970],  Finite  State  Markovian  Decision  Processes,  Academic 
Press,  New  York. 

El-Neweihi,  E. ,  F.  Proschan,  and  J.  Sethuraman  [1978],  "Multistate 
Coherent  Systems,"  J.  Appl.  Prob. ,  Vol.  15,  pp.  675-688. 


-199- 


Eppen,  G.  (1965],  "A  Dynamic  Analysis  of  a  Class  of  Deteriorating 
Systems,"  Man.  Sci. ,  Vol.  12,  No.  3,  pp.  223-240. 

Esary,  J.  D.  and  F.  Proschan  [1963],  “Relationship  between  System 

Failure  Rate  and  Component  Failure  Rate,"  Technometrics ,  Vol.  5, 
pp.  183-189. 

Esary,  J.  D.  and  F.  Proschan  [1970],  "A  Reliability  Bound  for  Systems  of 
Maintained,  Interdependent  Components,"  J.  Amer.  Stat,  Assoc., 

Vol.  65,  pp.  329-338. 

Esary,  J.  D. ,  F.  Proschan,  and  D.  W.  Walkup  [1967],  "Association  of 

Random  Variables,  with  Applications,"  Ann.  Math  Statist.,  Vol.  38, 
pp.  1466-1474. 

Fardis,  M.  N.  and  C.  A.  Cornell  [1981],  "Analysis  of  Coherent  Multistate 
Systems,"  IEEE  Transactions  on  Reliability,  Vol.  R-30,  No.  2,  pp. 
117-122. 

Fussell,  J.  B.  [1975],  "How  to  Hand-Calculate  System  Reliability  Charac¬ 
teristics,"  IEEE  Transactions  on  Reliability,  Vol.  R-24,  No.  3, 
pp.  169-174. 

Gottlieb,  G.  [1982],  "Optimal  Replacement  for  Shock  Models  with  General 
Failure  Rate,"  Oper .  Res. ,  Vol.  30,  No.  1,  pp.  82-93. 

Griffith,  W.  S.  [1980],  "Multistate  Reliability  Models,"  J.  Applied 
Prob. ,  Vol.  15,  pp.  735-744. 

Hadar,  J.  and  W.  R.  Russell  [1969],  "Rules  for  Ordering  Uncertain  Pros¬ 
pects,"  American  Econ.  Rev.,  Vol.  59,  No.  1,  pp.  25-34. 

Hatoyama,  Y.  [1979],  "Reliability  Analysis  of  3-state  Systems,”  IEEE 
Transactions  on  Reliability,  Vol.  R-28,  No.  5,  pp.  386-393. 

Hirsch,  W.  M. ,  M.  Meisner,  and  C.  Boll  [1968],  "Cannibalization  in  Mul¬ 
ticomponent  Systems  and  the  Theory  of  Reliability,"  Nav.  Res.  Log. 
Quart. ,  Vol.  15,  pp.  331-359. 

Hochberg,  M.  [1973],  "Generalized  Multicomponent  Systems  under  Canni¬ 
balization,"  Nav.  Log,  Res.  Quart.,  Vol.  20,  pp.  585-605. 

Kalymon,  B.  [1972],  "Machine  Replacement  with  Stochastic  Costs,"  Man. 
Sci.:  Theory,  Vol.  18,  No.  5,  pp.  288-298. 

Kao,  E.  [1973],  "Optimal  Replacement  Rules,"  Oper.  Res.,  Vol.  21,  No.  6, 
pp.  1231-1249. 

Klein,  M.  [1962],  "Inspection-Maintenance-Replacement  Schedules  under 
Markovian  Deterioration,"  Man,  Sci.,  Vol.  9,  No.  1,  pp.  25-32. 

Kolesar,  P.  [1966],  "Minimum  Cost  Replacement  under  Markovian  Deteriora¬ 
tion,"  Man.  Sci.,  Vol.  12,  No.  9,  pp.  694-706. 


-200- 


Lambert,  H.  E.  [1975),  "Measures  of  Events  and  Cut  Sets  in  Fault  Trees," 
Reliability  and  Fault  Tree  Analysis,  SIAM,  Philadelphia,  pp. 

77-100. 

Luss,  H.  [1976],  "Maintenance  Policies  when  Deterioration  can  be  Ob¬ 
served  by  Inspections,"  Oper.  Res.,  Vol.  24,  No.  2,  pp.  359-366. 

Postelnicu,  V.  [ 1970 ] ,  ‘ Nondichotomic  Multicomponent  Structures,"  Bull . 
Math,  de  la  Soc.  Sci.  Math,  de  la  R.  S.  de  Roumanie,  Vol.  14,  No. 

2,  pp.  209-217. 

Rosenfield,  D.  [1976A],  "Markovian  Deterioration  with  Uncertain  Informa¬ 
tion,”  Oper .  Res.  ,  Vol.  24,  No.  1,  pp.  141-155. 

Rosenfield,  D.  [1976B],  "Markovian  Deterioration  with  Uncertain  Informa¬ 
tion  -  a  More  General  Model,"  Nav.  Res.  Log.  Quart. ,  Vol.  23,  pp. 
389-405.  ' 

Ross,  S.  [1969],  "A  Markovian  Replacement  Model  with  a  Generalization  to 
Include  Stocking,"  Man.  Sci:  Theory,  Vol.  15,  No.  11,  pp.  702-715. 

Ross,  S.  [1979],  "Multivalued  State  Component  Systems,"  Ann.  Prob. ,  Vol. 
7,  pp.  379-383. 

Serfozo,  R.  [1979],  "An  Equivalence  between  Continuous  and  Discrete  Time 
Markov  Decision  Processes,"  Oper.  Res.,  Vol.  27,  No.  3,  pp.  6 16— 
620. 

Simon,  R.  M.  [1972],  "The  Reliability  of  Multicomponent  Systems  Subject 
to  Cannibalization,"  Nav.  Res.  Log.  Quart.,  Vol.  19,  No.  I,  pp.  1- 

14. 

Smith,  D.  R.  [1978],  "Optimal  Repair  of  a  Series  System,"  Oper.  Res., 
Vol.  26,  No.  4,  pp.  653-662. 


-201- 


security  CLASSIFICATION  of  this  RACE  ftn ion  Dal.  intotsd) 


REPORT  DOCUMENTATION  PAGE 


REPORT  NUMBER 


4  TITLE  (snd  Subtitle) 


MULTISTATE  RELIABILITY 


READ  INSTRUCTIONS 
BEFORE  COMPLETING  FORM 


j  recipient's  catalog  number 


s.  TYPE  OF  REPORT  *  PERIOD  COHERED 

Technical  Report 


t  performing  org.  report  number 


7.  AuThORCa) 


»  CONTRACT  OR  GRANT  NUMBERFaJ 


ALAN  P.  WOOD 


N00014-75-C-0561 


9  PERFORMING  ORGANIZATION  NAME  AND  AOORESS 

Department  of  Operations  Research  and  Department 
of  Statistics  -  Stanford  University 
Stanford,  California  94305 


"•  CONTROLLING  OFFICE  NAME  ANO  AOORESS 

Operations  Research,  Code  434 
Office  of  Naval  Research 
Arlington,  Virginia  22217 


4  MONITORING  AGENCY  NAME  A  ADDRESSflt  dlllsrsnt  from  Controlling  Olllco) 


10.  PROGRAM  ELEMENT.  PROJECT.  TASK 
AREA  A  WORK  UNIT  NUMBERS 

(NR-047-200) 


tz.  REPORT  OATE 

June  1983 


ts.  NUMBER  OF  PAGES 

201 


IS.  SECURITY  CLASS,  (ol  this  r sport) 


Unclassified 


IS*.  DECLASSIFICATION/ DOWNGRADING 

schedule 


te.  DISTRIBUTION  STATEMENT  (ol  this  Rsport) 


APPROVED  FOR  PUBLIC  RELEASE:  DISTRIBUTION  IS  UNLIMITED 


17  DISTRIBUTION  STATEMENT  (ol  ths  sbstrscl  sntsrsd  In  Block  70,  II  dlltsrsnl  Iroai  Koporl) 


U.  KEY  WORDS  (Continue  on  rovroo  old*  if  nocoooory  and  Identity  by  block  nwabar)  , 


RELIABILITY 


COHERENCE 


MULTISTATE  COMPONENTS 


MARKOV  DECISION  PROCESSES 


20.  ABSTRACT  (Continue  on  tororao  aide  If  nocoooory  end  Identify  by  block  number) 

(SEE  NEXT  PAGE) 


DD  |  JAN *71  1473  A  COITION  OF  I  NOV  AS  I*  OBSOLETE 


s/n  oioz-oia-ssoi 


_ UNCLASSIFIED _ 

SECURITY  CLASSIFICATION  OF  TniS  PACE  fBtiao  Dots  tntsrsd! 


_ UNCLASSIFIED _ 

StCUMTY  Ct_A'.SIF'CATlOH  OF  THIS  PAG*  Q«U 

ABSTRACT:  MULTISTATE  RELIABILITY 
by  Alan  P.  Wood 


>  In  modern  society  the  consequences  of  system  failure  can  be 

!  catastrophic.  The  study  of  reliability  has  evolved  from  the  desire  to 
i  prevent,  or  at  least  mitigate  the  consequences  of,  failure.  A  reliability  . 

j  analysis  is  performed  to  determine  the  probability  that  a  component  or  system  j 

i  is  able  to  perform  its  specified  function.  Two  major  topics  useful  in  that 
j  endeavor  are  considered  herein.  The  first  topic  is  the  extension  of  coherent 
j  structure  function  theory  to  components  and  systems  with  several  states.  The 

|  second  topic  is  the  optimal  maintenance  of  multistate  components. 

|  Coherent  structure  function  theory  is  an  axiomatic  approach  to 

‘  reliability  in  which  the  components  and  systems  are  binary,  i.e. ,  they  have 
|  two  states  -  operational  and  failed.  The  first  part  of  the  thesis  extends 

j  the  theory  to  components  and  systems  with  multiple  states.  This  is  useful 

j  for  modeling  systems  in  which  partial  failure  may  occur.  Multistate  coherent 
!  structure  functions  are  defined,  and  it  is  shown  that  most  of  the  binary  i 

!  results  have  multistate  analogs.  These  results  deal  with  duals,  modules,  I 

•  minimum  cut  and  path  sets,  reliability  importance,  reliability  bounds,  i 

•  closure  theorems,  fault  trees,  and  block  diagrams.  The  theory  is  further  j 

>  extended  to  allow  each  component  and  the  system  to  have  a  continuum  of 
j  states. 

I  Optimal  maintenance  policies  for  periodically  inspected  multistate 

■  components  have  previously  appeared  in  the  literature.  The  second  part  of 
the  thesis  extends  those  policies  to  continuously  monitored  equipment  by 
using  Markov  decision  processes  and  continuous  time  Markov  chains.  The  main 
theorems  are  in  the  form  of  control  limit  rules  which  state  that  it  is 
optimal  to  repair  or  replace  a  component  whenever  it  has  degraded  to  a 
certain  level.  It  is  shown  that  under  certain  assumptions  the  optimal  policy 
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